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An Optical Apparatus for Very-Small-Angle 
Light Scattering— Design, Analysis 
and Performance 


By J. B. LASTOVKA 
(Manuscript received April 4, 1976) 


We describe an optical apparatus designed and built to extend con- 
ventional light-scattering measurements to the very-small-angle regime. 
The present instrument covers the angular range 0.003° = 6 S 0.15° 
with an instrumental resolution (HWHM) of 0.00045° (1.6 arc seconds), 
and exhibits an exceptionally low stray-light background. The theoretical 
and practical considerations important in achieving this performance are 
analyzed in detail. Besides its primary purpose of studying long-wave- 
length (0.01 cm to 1 cm) thermally driven fluctuations, the present type of 
apparatus should also prove quite useful in other areas where long-wave- 
length perturbations must be probed, such as, (1) holographic and optical 
memory imaging, (12) surface roughness testing, and (tit) index of 
refraction profiling. 


I. INTRODUCTION 


Laser light scattering has, over the past decade, been developed!—3 
into an extremely powerful tool for probing the long-wavelength 
(A; &% 2 X 107° cm to 2 X 107% cm) elementary excitations of liquids, 
gases, and solids. Combined with diffraction grating, Fabry-Perot, or 
optical mixing spectrometers the technique is capable of spanning an 
impressive range of more than 13 decades in energy or frequency 
measurement. Yet, in contrast to what has happened in the field of 
inelastic X-ray scattering,*:> very little has been done to utilize 
very-small-angle (vsa) light scattering to probe longer-wavelength 
(Ay & 10-3 cm to 1 cm) excitations. With a few notable exceptions,*—" 
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most light-scattering experiments have been limited to the scattering- 
angle range @ > 1°. 

There have been a number of reasons for this apparent lack of 
progress in the very-small-angle scattering regime. On the one hand, 
experimentalists in the field, encountering a seemingly divergent stray- 
light level at small angles, have assumed that attempts to work in 
the vsA region would present insurmountable problems. On the other 
hand, there did not appear to be any physical phenomena where the 
important elementary excitations were confined to the corresponding 
longer-wavelength regime. Or, in cases where they were, it seemed that 
the use of more conventional macroscopic experimental techniques 
represented a satisfactory experimental approach. 

Recently, however, there has been a resurgent interest in problems 
involving general hydrodynamic instabilities? both in normal liquids 
and liquid crystals.!%-2 The “critical wavelengths’? involved in the 
onset of these instabilities are, in general, controlled by some macro- 
scopic dimension of the sample chamber and tend to fall in the range 
100 um < A, < 1 em. Light scattering is the only technique offering - 
the possibility of probing these wavelengths without physically disturb- 
ing the sample and with a sensitivity sufficient to detect the thermally 
driven critical fluctuations. However, probing the excitation wave- 
length region 100 um < A < 1 emrequires the capability of resolving and 
detecting the scattered light at very small angles, 0.3° = @ = 0.0038”. 

This paper describes the experimental progress which has been made 
in extending the light-scattering technique to this very-small-angle, 
long-wavelength regime. 

In Section IJ, we describe the physical configuration of a light- 
scattering apparatus that has been constructed for use in the vsa 
region. This section also summarizes the measured performance 
characteristics of the instrument in terms of angular resolution and 
stray light. Section III is a detailed presentation of the basic diffraction 
and aberration considerations that influence the design of a VSA 
light-scattering apparatus. Section IV outlines various empirical obser- 
vations made during the course of construction of the present instru- 
ment, relating to the stray-light behavior of optical components at 
small angles. 


Il. AN APPARATUS FOR VERY-SMALL-ANGLE LIGHT SCATTERING 
2.1 Introduction 

In this section, we present a general description of the physical 
layout and performance of a light-scattering apparatus that has been 
constructed for the vsa regime. The theoretical background and 
practical considerations necessary to analyze the detailed charac- 
teristics of the instrument are deferred to Sections III and [V. Although 
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designed specifically for the study of the Bénard convective instability, 
this apparatus embodies solutions to most of the problems to be 
encountered in the general small-angle light-scattering experiment. 


2.2 Performance goals 


The following performance goals were established for the present 
instrument and evaluated at the various stages of construction and 
modification: 


(t) The ability to make quantitative measurements of both the 
scattered intensity and the temporal intensity autocorrelation 
function for scattering angles ranging from a few mrad down to 
at least 50 wrad. (We will, in general, specify angular deflections 
in prad; Table I lists conversion factors to other common units 
of angular measure.) 

(2t) A stray-light level per coherence solid angle in the scattered 
field (d@,./dQcoxn) that was less than 10-6 of the incident beam 
power. 

(222) An angular instrumental resolution of less than 15 urad. 

(2v) The capability of continuously scanning the instrument over 
a reasonable range in scattering angle without the need for 
realignment. 

(v) The attainment of near-diffraction-limited performance using 
customary spherical optics of reasonable cost. 


Taken individually, each of the above goals can be met or bettered 
by existing optical instruments. To cite just two examples, the 200-inch 
Mount Palomar telescope has a diffraction-limit angular resolution of 
about 0.1 wrad; and, in a typical @ = 90 degrees light-scattering ex- 
periment, the desired stray-light level would be considered a straight- 
forward achievement. Insofar as the angular range is concerned, we 
can easily show that the scattered light observed at these angles is 
contributed by plane-wave components of the refractive-index per- 


Table |— Conversion factors between various common units 
of angular measure 


Deg Rad mrad prad bh Arc Sec 
1 Deg 1 0.0174 17.45 17,453 60 3600 
1 Rad 5.73 1 103 106 | 3488 | 2.06 x 105 
1 mrad 0.0573 10-3 1 103 | 3.438 206.3 
1 prad 5.73 X 1075 10-6 10-3 1 | 0.0034 0.2063 
l are min. 1/60 2.91 X 1074 | 0.291 291 l 60 
1 are sec. 1/3600 4.85 X 107§ | 4.85 X 1073 | 4.848 | 1/60 1 
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turbations in the sample whose wavelengths, A, are given by the 
small-angle Bragg condition 


A = o/8, (1) 


where Xo is the incident-beam wavelength. Therefore, probing the 
scattering-angle range from 50 yrad to 3 mrad gives information about 
Fourier components of the refractive index having wavelengths be- 
tween 1.0 cm and 0.016 cm, respectively. Here we can point out that 
this spatial-frequency region is routinely examined by common inter- 
ferometric checking methods and holographic techniques. 

The instrument described in this paper is unique in that it meets 
all of the performance criteria simultaneously. In being able to probe 
perturbations with wavelengths as long as 1.0 cm, it represents a 
100-fold improvement on previous low-stray-light-level scattering in- 
strumentation, while its small stray-light background gives it a 
1000-fold sensitivity advantage over conventional interferometric and 
holographic equipment. On a per-unit-aperture-size basis, its ability 
to resolve closely spaced faint (10~*) and strong (1) features is about 
50 times better than the Mount Palomar telescope. 


2.3 Optical components and physical configuration of the instrument 

Figure 1 sketches the optical configuration of the most recent 
version of the apparatus designed to meet the performance criteria set 
out in the preceding paragraphs. For brevity, we refer to this par- 
ticular optical system as the MK VI instrument. 


uM Lf 
SV 4 
ice taal es i L, A2 
| | 1 
| | 
l | 
| , ARGON LASER M, 
| 
bees aree | 
x My 
MN L 
| M 
2 
Yeu. 
| — BERS 
ae ——————— 
m 2 
DET l3 4 Me 7? 7 


Fig. 1—Optical component layout of the MK VI small-angle-scattering instru- 
ment. Component sizes and spacings are shown approximately to scale. 
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Before discussing the specific function of the various elements of 
the spectrometer, we present below, for reference purposes, a brief 
description of each of these elements and their mounting following the 
identification scheme used in Fig. 1. Whenever spatial or angular 
displacements are specified, they are to be interpreted according to the 
conventions illustrated in Fig. 2. The ¢ (or 6) and # (or ¢) axes are 
taken to be mutually orthogonal cartesian (angular) coordinates 
perpendicular to the axial ray at the point in question. The £(6) 
direction will always lie in the plane of Fig. 1, the instrument’s tan- 
gential plane, while #7(¢) will denote the vertical or sagittal plane. 
The direction of beam travel defines the local 2 axis. The basic hardware 
components of the MK VI instrument are the following: 


A,;—An adjustable circular diaphragm stop with an aperture 
diameter dai Y 5 mm. 

A»—A fixed, precision-pinhole aperture, daz = 100 um. Age is 
mounted with £ and # vernier adjustments relative to I. 

Argon ion laser—The laser is normally adjusted to provide | be- 
tween 50 mW and 200 mW of output at either Ay = 5145 A or 
do = 5017 A. The laser used has a flat- long radius spherical 
resonator, placed at about } hemispherical spacing, and oscillates 
in TEM 00 modes only. The ouput is a well-collimated beam with 
a slight spheroidal distortion. The beam has a gaussian intensity 
profile with a diameter of 1.4 mm as measured to the 1/e? points. 


iy 


(AXIAL RAY) 






<> 


Fig. 2—Cartesian coordinate system for (z, y, z) showing the cartesian angular 
deflection @ and ¢. 
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DET—A silicon-diode photodetector. The diode used is a photo- 
voltaic device operated without bias as a current source. The 
detector has a 1-cm-diameter active area. 

KE—A precision knife-edge custom-fabricated from neutral-density 
“black glass” plate. The 40-mm-long edge is straight to within 
1 wm and nick free. 

Ly—Plano convex achromat with a focal length fz, = 132 mm. Ly 
and A, share a common mount with vernier 7 and 2 degrees of 
freedom. 

L.—An anastigmatically mounted pair of plano convex achromats 
with an effective focal length fz. = 94.77 mm. Lz has (4, 9, 4, 6, ) 
vernier adjustability. 

L3—An achromatic lens having fr; = 150 mm and a mounted free 
aperture diameter of 35 mm. 

M1, M2, M3, M.—Flat mirrors 13 inches in diameter with \/10 
surface figure. 

M;, M,—Dielectrically coated, concave, spherical mirrors fabricated 
of fused quartz. They have a radius of curvature of 2 manda 
surface conformity of \/10. The mounted free aperture is 6.5 cm 
in diameter. 

M,, M;—Aluminized, first-surface, fused-quartz, flat mirrors. They 
have a mounted free aperture of 13 cm and a surface figure of \/20. 

S—A bilateral slit with straight jaws that can be used to reduce the 
gy dimension of the probe beam. 

SLZ—A commercial, precision, bilateral slit. The jaws have a 50 mm 
usable height and an accurately adjustable opening range from 
3 um to 3 mm. The slit assembly is mounted on a precision x-z 
translational stage positioned by large-barrel micrometer heads 
with a maximum conforming error of about 1 um. The x(6@)-axis 
micrometer can be manually positioned or can be driven by a 
digitally controlled stepping motor. 

SV—The location of the scattering sample. 


These optical components are mounted on a 3-inch-thick aluminum 
slab that forms a stable base for the instrument. Because random 
laboratory air currents and temperature gradients can cause angular 
beam deflections comparable to the instrumental resolution, the entire 
apparatus is covered by an essentially air-tight Plexiglas* enclosure. 


2.4 Functional description of the apparatus 


We can most easily describe the basic optical characteristics of the 
instrument by following the beam path through the system starting 
at the laser source. 


“ Registered trademark of Rohn & Haas Company. 
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Mirrors M1, M2, M3, and M, steer the laser output beam around to 
a spatial filter assembly comprised of A1, Zi, and A». Lens LZ, and pin- 
hole A» form the conventional spatial filter arrangement, while the 
pre-aperture A, serves to block high-angle-beam trash, such as multiple 
reflections in the laser resonator output mirror. The long path length 
through M,—//, and aperture A, also provides a significant reduction 
in laser tube discharge light that would otherwise pass through the 
system. 

The spherically spreading wave coming from A, is recollimated off- 
axis by M;. The collimated beam leaving J/; has a diameter* D(1/e) 
of approximately 1.65 cm. The wave-front planarity of this beam is 
measured and adjusted using a wave-front shearing interferometer 
aligned to give a 7-mm shear in the tangential plane. The tangential 
direction wave-front curvature is reduced to less than \/8 over the 
beam aperture by translating the spatial filter assembly along the 
laser beam (4) axis. It is important to note that the use of this off-axis 
collimation scheme produces a large amount of astigmatism and 
tangential plane coma. As a result, 24 2s not possible to make the probe- 
beam wave fronts straight in both the £ and @ directions simultaneously." 
The alignment procedure just described is intended to give diffraction- 
limited angular resolution in the 6 plane with some sacrifice in ¢ 
direction resolution. 

The collimated probe beam is now sent to the scattering object at 
SV via the flat mirror M/,;. Flat mirror MW; collects the transmitted 
beam and small-angle scattered light and directs them to /¢. 

In the tangential focal plane of spherical mirror M/¢, the directly 
transmitted beam is brought to a vertical (7) line focus at a position 
we define as txz = 0. Light that has been scattered by some angle 6 
is brought to line focus in the same plane, but at a displaced transverse 
position 


xKe(O) = frie tan dD fac), (2) 


where fis = 100 cm is the focal length of M.. Therefore, for sufficiently 
small values of 6, where tan 6 90, angular deflection maps linearly 
into lateral displacement at the focus with a position-angle dispersion 
(PAD) constant given by 


PAD(KE) = tan) = fue = 1 wm/prad. (3) 


The knife-edge KE is located in this tangential focal plane with its 
edge vertical and can be set to intercept the transmitted beam at 


*See Section 3.1 for the definition of these quantities. 
T See Section 3.2. 
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6 = 0 to prevent it from entering the remaining portion of the optical 
system. In normal practice, KE is adjusted to occult all light for which 
6 < 30 to 50 wrad. The position and orientation of KE relative to M, 
is fixed with diffraction-limited accuracy using the standard Foucault 
knife-edge test procedure. 

Lens pair LZ, re-images the focal plane of M, onto the vertically 
oriented main receiving slit SL with a magnification of about (2.54). 
Therefore, the dispersion constant in the slit plane has the value 


PAD(SL) = 2.54 um/urad (4) 
or 


PAD(SL) = 0.0001 in./prad. (5) 


The magnification by Le allows the scattering angle 6 to be read directly 
on the “‘english-units’’ micrometer that positions the slit. More im- 
portantly, it relaxes the stability and accuracy requirements that must 
be imposed on the slit scan mechanism. Since SL has a minimum 
opening setting of roughly 3 um, the slit-limited angular resolution is 
about 1 urad. 

The proper locations and orientations for the main slit SZ and lens 
L, are determined by an iterative procedure in which one of the jaws 
of SL and the zmage of the knife-edge formed by Lz at the slit plane are 
positioned to form an apparent two-jawed slit. The absence of distor- 
tion in the Fraunhofer diffraction pattern formed when this ‘‘slit’’ is 
illuminated by a collimated beam becomes a diffraction-limited test 
for correct lens and slit alignment. 

The scattered light passed by the main slit is collected by Z3 and 
sent to the photodiode DET. The focal length and position of Ls; are 
chosen such that the real image of the limiting aperture of M7, formed 
at the plane Mg by lens L, is re-imaged onto the detectors active area. 


2.5 Observed angular-resolution performance 


We can assess the @ direction angular resolution of the MK VI 
apparatus from measurements of intensity as a function of slit position 
(x) in the absence of a scattering object. Two such “instrumental 
profiles” are shown in Fig. 3. The ordinate scale is logarithmic in the 
detector photocurrent with a rough correspondence of 200 pA/mW of 
optical power. Curves A and B were taken under identical conditions 
except for the position of the knife-edge KE. For curve A, the knife- 
edge was withdrawn to allow the direct probe beam to reach the 
scanning slit, while for curve B, it was positioned to occult all light 
in the region 6 = 50 urad. Note that the use of the knife-edge provides 
a significant decrease in observed stray-light level, the reduction 
amounting to about an order of magnitude improvement for 6 = 600 
urad (see Section IV). 


1232 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1976 


10-4 
5 tg 
10 oc 
uw 
= 
10° << 
= 
a 
ir 
8 « 
10 oc 
© 
ra) 
Ee 
Oo 
a a 
a 


1079 
(8) or? 
1071! 


10712 
1200 1000 800 600 400 200 0 ~200 


0 IN RADIANS 


Fig. 83—Observed instrumental profiles for the MK VI apparatus plotted on a 
logarithmic intensity scale. Curve B was obtained with the knife-edge, KE, occulting 
the direct beam, while curve A was measured with KE retracted. 

An expanded view of the 6 & 0 region of Fig. 3 is shown in Fig. 4. The 
dashed curve represents a best fit of the gaussian exp — [6?/66?(1/e) | 
to the instrumental line shape, as detailed in Section 3.1. The full 
width at half-maximum of the fitted curve is 


A6(4) = 16 urad. (6) 


For the traces shown in Figs. 3 and 4, the main slit width was set at 
5 um which, from eq. (5), is equivalent to a 2-urad acceptance angle. 
Under these conditions, the effect of artificial slit broadening on the 
line shape may be neglected, as outlined in Appendix A. 

Deriving a value for the sagittal, or ¢ direction, resolution is a more 
complicated procedure because of the large instrumental astigmatism 
(see Section 3.2). However, a pragmatic number can be given using 
the following operational definition. If the sagzttal resolution were 
measured in the tangential focal plane of Me, the location of the main 
slit, the instrumental profile would have a full width at half maximum 
given by 

Av(s) = 93 urad. (7) 
(See Section 3.2, especially Figs. 16 and 17.) 

The overall angular resolution characteristics of the instrument are 
illustrated in Fig. 5. This sketch shows various fraction-of-maximum- 
intensity contours for the instrumental profile as determined at the 
main slit plane. 
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Fig. 4—Measured instrumental profiles for the MK VI apparatus in the region 
around @ = 0. Curve B was obtained with the knife-edge, KE, occulting the direct 
beam, while curve A was measured with KE retracted. Dotted curve is best fit of 
the function exp[_ —6?/56?(1/e) ] to the transmitted beam profile. 


The measured profiles presented in Figs. 3 and 4 and the corre- 
sponding contours of Fig. 5 were obtained using the full g-axis beam 
height of the instrument, that is, in the absence of aperturing of the 
probe beam by slit S of Fig. 1. As such, the quoted Ag resolution does 
not include any diffraction broadening associated with g direction 
vignetting of the main beam. At full aperture, the instrument’s 6 
resolution is essentially diffraction limited, while the ¢ resolution is 
dominated by astigmatic blurring. However, as the beam height is 
stopped down, diffraction spreading will eventually override the 
astigmatism and the instrument will be solely diffraction limited. For 
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the MK VI, this crossover point occurs at a beam height of about 
0.5 em or roughly 7 of the full design aperture. Therefore, the most 
advantageous use can be made of the present apparatus when the 
desired probe-beam geometry consists of a collimated “‘sheet”’ or ribbon 
illumination. 


2.6 Analysis of stray-light performance 


A second crucial performance characteristic of any light-scattering 
instrument is its stray-light level in relation to the scattering efficiency 
of the sample under investigation. For the MK VI instrument, the 
ratio of recorded stray-light photocurrent to the photocurrent observed 
at the peak of the transmitted beam, say, can be read directly from 
Figs. 3b and 4b; however, this ratio is not of immediate physical 
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Fig. 5—Contours of constant intensity for the instrumental line shape. Each con- 
tour is labelled in terms of a fraction of the peak intensity, /(@ = 0, ¢ = 0). 
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significance because of the effects of residual instrumental astigmatism. 
The directly measured 7(@)/i(0) ratio requires a certain amount of 
mathematical interpretation to provide the stray-light ratio values 
that will be relevant in signal-to-noise or observability calculations.!—3:27 
In general, the quantities that are most important in this regard are: 
(2) The scattered power per unit solid angle divided by the incident 
probe-beam power: 
i dP;,(9, yg) = dR,(9, ¢) . (8) 
Py dédy dQ 


The ratio [d@,(0, ¢)/dQ] is a frequently used measure of the scattering 
power of an object; calculating a value of this ratio appropriate to the 
stray light [d@.:(@, ¢)/dQ] provides a basis for estimating the ob- 
servability of a particular scattering feature. This quantity can be 
extracted more or less directly from 7(@)/7(0) given (1) the effective 
solid angle subtended by the main slit and detection optics, and (2) 
the instrumental profile contours of Fig. 5. 
(72) The normalized scattered power per scattering normal mode: 


0; (K;) ; 


= (9) 


This latter quantity appears in scattered intensity calculations in 
which the index-of-refraction perturbations in the illuminated scatter- 
ing volume are represented in terms of an orthonormal plane wave 
Fourier expansion.*:?? The mean-square amplitude of these modes and 
their scattering efficiency are, in general, easily calculated from the 
known physical properties of the sample. Characterizing the stray-light 
via a ratio @.:(K;)/@o provides another convenient way of determining 
the observability of the scattering from a particular sample object. 

(722) The normalized scattered power per coherence solid angle in 
the scattered field: 


1 d®.(6, g) _ d&s(6, ¢) 
Po ~ dQcon a qa Qcox. (10) 

This quantity appears in signal-to-noise ratio calculations relevant 
to determining the spectrum of the scattered light from measurements 
of the temporal autocorrelation function or spectrum of the detected 
photocurrent.?:*.?7 In this case, it is useful to also characterize the 
stray-light level in terms of the quantity [d@®.:(@, ¢)/dQcon |. The 
ratio [d®;,/dQcox | differs from that defined in eq. (8) in that the solid 
angle is specified as being the solid angle of spatial coherence in the 
scattered field, Qeoy. The coherence solid angle is a measure of the 
range in 6 and ¢ about some arbitrary reference direction (6, v) over 
which the amplitude and/or phase of the scattered electric field exhibits 
statistically correlated behavior. In the typical light-scattering experi- 
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ment, the extent of spatial coherence in the scattered field is controlled 
by the geometry of the scattering sample and wave diffraction. In a 
first approximation,?’ the coherence solid angle is just the diffraction 
solid angle of the scattering source; that is, 


d nN 
Qcon = Abcon X Agcon = (2) e) ; (11) 
4 


where be and b, are extremal dimensions of the illuminated sample 
volume as viewed from a direction specified by (6, ¢). However, for 
the MK VI instrument, the extent of the spatial coherence is partly 
determined by residual aberration effects, and an evaluation of the 
ratio [d®,:(@, ¢)/dQcox | requires a specific calculation of the spatial- 
coherence properties of the optical field at the main slit plane. 

Obtaining expressions for the various stray-light ratios when aber- 
rations are present requires a rather lengthy detailed analysis, as is 
carried out in Section 3.3. For our purposes here, we merely quote 
those results that lead to the numerical ratios appropriate to the 
MK VI instrument. In each case, the procedure 1s to treat the observed 
stray-light level as if 1t originated from a fictitious ‘sample’ placed 
at the normal position of the scattering volume. After deriving the 
expressions that relate slit-plane intensity to a real sample’s scattering 
cross section, expressed for example as [d@,(6, ¢)/dQ |, we utilize these 
results in reverse fashion to calculate the effective cross section of our 
fictitious stray-light sample. Of course, these expressions derive from 
the main slit-plane imaging characteristics of the instrument; there- 
fore, In succeeding paragraphs, whenever angles or solid angles are 
specified, they are to be interpreted as slit-plane coordinates or areas 
converted to angular units via eq. (5). Consider first the quantity 

ARs (8, y) a 1 dP s1(8, y) 0 
dQceou 7 Po déd i 


1 de;.(8, —- =— 
= P, ae AécouA ¢cou, (12) 


where A@con and Agcoy are the full-width coherence angles in the 6 
and ¢ directions. When the main-slit acceptance angles A@ész and Aggrz 
satisfy the inequalities 
Adsi< Adcor = 24.1 prad 
A vst > A vcou = 140.4 prad, 


as they do for the profiles of interest here, the right-hand side of eq. 
(12) can be expressed in terms of the measured photocurrent, 7(@), as 


(13) 


dRs:(0. ¢ = 0) ~~ 7(6) A ¢cou 
EN es SN 14 
AQcou 2(0) m Agst ’ aa 


where 2(0) is the photocurrent observed at the peak of the direct 
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transmitted beam. In writing eq. (14), we have assumed that Aggr 
is symmetrically placed around ¢ = 0. The ratio [Agsz/Agcor | is 
essentially the number of slit-plane coherence areas sampled by the 
detection optics. For the instrumental profiles shown in Figs. (3) and 
(4), Agsz was limited solely by the free aperture of lens L3. Using the 
proper lens free aperture diameter and the linear dispersion constant 
given in eq. (5), we find an effective slit-acceptance angle 


Ags, = 1.4 X 104 wrad = 0.79°. 


This value of Aggsz corresponds to a slit height that samples approxi- 
mately 100 coherence areas. The ratio [Agsr/Agcon |, eq. (14), and 
the data of Fig. 3 combine to give the [d@;.(0, 0) /dQceox | values listed 
in Table IT. 

The normalized stray-light power per mode can be found from 
[d@s.(6, 0)/dQcox | by the methods detailed in Section 3.3. The basic 
procedure involves calculating both the scattered power per coherence 
area and @,(K,;) from a common starting point to obtain the correction 
term that relates them. In the present case, the required relationship 
has the form 


dRs+(6, 0) = P(K;j) ¥, Agcon Adcoun (15) 
dQcou Po (A/by) (A/bz) ’ 
where 6, and 6, are the clear aperture width and height of the instru- 
ment and d is the optical wavelength. The product of the ratios 








A goon Aécon 


O76) 82 Gon) 


is a weighted measure of the number of K; modes contributing to the 
power observed in a single coherence area at the main slit. At the full 
aperture of the MK VI instrument, 6, = 5 cm and b, = 5 cm, the 
correction factor has the value 














A vcon Acou 
= 33.6. 16 
(\/b,) O/bs) 16) 
Table 11—Numerical values of various stray-light ratios for the 


MK VI instrument at selected scattering angles 


é-yrad 7(0)/2(0) ARs: (0, 0)/dQcox Pst(K;)/ Po 
50 1.07 X 1074 1.5 X 1078 4.5 X 1078 
100 2.82 X 1075 4.0 X 1077 1.2 X 1078 
200 9.55 XK 10-6 1.4 X 1077 4.0 X 10° 
200 2.82 X 1076 4.0 x 1078 1.2 K 10° 
1000 1.62 * 1076 2.6 X 107° 6.8 X 10°” 


Ads, = 2urad, Ags, = 14,000 wrad, A@con = 24.1 wrad, and Agcow = 140.4 prad. 
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Combining eqs. (15) and (16) with the values of [d®,:(0, 0)/dQcox | 
already calculated gives the stray-light-per-mode ratios to be found 
in Table II. 

The numerical values of the various stray-light ratios may be put 
into perspective by calculating the amplitude of some physical per- 
turbations that would generate a scattered intensity equal to the 
observed stray-light level. Based on a theoretical analysis of the 
scattering problem, it may be shown that, for sufficiently small 
scattering angles, the actual three-dimensional scattering volume can 
be taken to be equivalent to a two-dimensional phase-object placed 
normal to the incoming probe beam.”® In this two-dimensional phase- 
plate equivalent, the scattering disturbances appear in the form of a 
spatially varying phase thickness ¥(2, y), which is the line integral of 
the instantaneous index of refraction encountered by a ray traversing 
the actual sample at the lateral position (a, y). If n(z,.y, 2) is the local 
index of refraction in the actual three-dimensional scattering problem 
then y(xz, y) is given by 


Lz 
vo, 0) == [ n@, u, ade (17) 


where L, is the length of the illuminated volume along the direction of 
the incident beam. The phase perturbation y(z, y) may be represented 
in terms of a two-dimensional plane-wave Fourier expansion 


W(2, y) = DY VK eeu, (18) 


with the K; = (K,, K,) chosen to make the expansion functions 
orthonormal over the instrument’s full aperture. In this formulation 
of the problem, the normalized scattered power per K; mode has the 
simple form 





,(K; 2 
pe = HIVE) I, (19) 
where the angular brackets denote an appropriate time or ensemble 


average. 

The expression for the scattered power given in eq. (19) may be 
used to interpret the stray-light levels observed in the MK VI instru- 
ment in terms of a minimum detectable amplitude for a specific physi- 
cal scattering mechanism. In succeeding paragraphs, we consider three 
such scattering processes: (7) static index of refraction modulation in 
a transparent slab, (77) surface height modulation on a reflecting 
mirror, and (777) temperature modulation in an otherwise homogeneous 
liquid. 

2.6.1 Refractive modulation in a slab 

The scattering from a static sinusoidal refractive-index modulation 
in a plate is an interesting model problem relevant to holographic 
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memories and general phase-grating problems. We take the sample 
object to be a nominally homogeneous plate of thickness L, and 
refractive index mo in which a small sinusoidal index disturbance 


én(x, y, 2) = dne*@? (20) 


has been created. The index perturbation is assumed to be uniform in 
the @ direction such that Q lies in the (xz, y) plane. The local index in 
the plate is 


n(x, Y, 2) = No E + a eer] (21) 

0 
which, when inserted in eq. (17), gives the phase function y(a, y) as 
va, u) = 2 [mole + ane] (22) 


The required Fourier amplitude, (|y(K,) |2), is obtained by inspection 


from eq. (22) as 
v@l) = | (52)(2) nor. (23) 


This result may be used together with eq. (19) to obtain the pertur- 
bation amplitude (6n/no) necessary to produce a given scattered power 
per mode. For example, taking L, = 1 cm, mo = 1.5, and A» = 5000 A, 
we find that a refractive-index amplitude 6n/no = 1.2 X 107° yields a 
normalized scattered power per mode equal to the MK VI’s observed 
stray-light value at @ = 100 urad. To produce scattering at this angle, 
the wavelength of the perturbation A = 27/|Q| would have to be 
A = (0/0) = 0.5 ecm. Table III lists the “‘background equivalent” 
6n/no values corresponding to other values of @(A). 


2.6.2 Height modulation on a reflecting surface 


Another interesting example from the viewpoint of stray-light level 
comparison is the scattering from a surface height disturbance on an 
otherwise perfect reflecting mirror. Clearly, this problem can also 
serve as a model for calculating the instrumental background when 
mirror surface roughness (see Section IV) is the dominant source of 
stray light. 

Since the primary effect of a surface height deviation is to produce 
a phase perturbation on the reflected wave-front, the phase function 
(x, y) can be written down immediately as 


Ha, y) = DCD nx, 9), (24) 


where h(x, y) gives the local physical height displacement from the 
nominally perfect geometric surface. 
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Table !Il—-Amplitudes of three scattering perturbations 
necessary to scatter an amount of light equal to 
the instrument’s stray-light level 


9 rn Refractive Surface Temperature 
(urad) (cm) O,(K;)/ Po Perturbation Corrugation Perturbation 
- (6n/no) (6h-A) (67'-°C) 

50 1.0 4.5 X 1078 2.2 X 10° 0.16 6.8 X 1078 

100 0.5 1.2 X 1078 1.2 x 10° 0.084 3.0 X 1076 

200 0.25 4.0 X 107° 6.7 X 107° 0.049 2.0 X 1078 

500 0.1 1.2 X 10° 3.6 X 107% 0.027 1.1 X 1078 
1000 0.05 6.8 X 1071 2.8 X 1071° 0.021 8.4 XK 1077 


We will take h(x, y) to be a small, static, sinusoidal corrugation 
h(x, y) = 6h et? (25) 


for which the phase function is just 
(x, y) = > — et, (26) 
0 


The Fourier amplitude (|¥(K,) |?) follows trivially as 
ae 4 
(vax) = | an | 
0 


Combining eqs. (27) and (19) with the data of Table II gives the 
“background equivalent” surface corrugation amplitudes listed in 
Table III. Again, these are the surface amplitudes necessary to yield 
a normalized scattered power-per-mode equal to the MK VI’s stray- 
light level. For example, when A = 27/|Q| = 0.5 cm, the “background 
equivalent”’ corrugation has an amplitude of 6h = 0.084 A or, in the 
usual surface-figure parlance, 


sh & /60,000. 


2 


(27) 


2.6.3 Temperature modulation in a liquid 


As a final example, we consider an otherwise homogeneous slab of 
liquid of thickness L, on which is impressed a small sinusoidal tem- 
perature disturbance, 


é6T (x, y, 2) = 6Te*@, (28) 


with Q lying in the (x, y) plane. The calculation of the scattering from 
such an object is really just a simple extension of the result obtained 
above for refractive index modulation. The temperature perturbation 
produces an associated index disturbance that is responsible for the 
scattering. If the temperature perturbation in eq. (28) is impressed 
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isobarically, the associated index modulation is simply 
in(x, v2) = (5p) oT Ha 


on 
1Q+ ro 
= (7), dT e (29) 


Equations (20) through (23) may now be used to obtain the relevant 
mean-square phase amplitude, namely, 


(1¥(Q) |?) = ks =( 2) rar] (30) 


Taking LZ, = 1 cm and A, = 5000 A and using a typical value for 
(dn/O0T)p in liquids, (dn/dT)p =—5 X 10-*4/°C, we find the back- 
ground-equivalent temperature amplitudes listed in Table III. 


2.7 Conclusion 


In this section, we described the basic features of an optical instru- 
ment capable of extending conventional light-scattering measurements 
to an angular range (50 wrad to 3 mrad) not previously accessible. In 
addition to a diffraction-limited angular resolution of a few seconds of 
arc, the MK VI instrument exhibits an exceptionally low stray-light 
background making it an effective tool for probing small-amplitude- 
scattering processes. Besides its primary purpose of studying long- 
wavelength (0.01 cm to 1 cm) thermal fluctuations, the present type 
of apparatus should prove quite useful in other areas where long-wave- 
length perturbations must be probed, such as, 


(<) Holographic and optical memory imaging. 
(22) Surface roughness testing. 
(272) Index of refraction profiling. 


In general, the MK VI offers a sensitivity improvement of a factor 
of about 1000 over the instrumentation normally used for such 
measurements. 

While we have given a rather broad overview of the apparatus in 
the present section, we have not attempted to present the fundamental 
considerations on which the design is based. We refer the reader who 
is interested in these questions to the remaining sections of this paper. 


Hl. THEORETICAL CONSIDERATIONS IN THE DESIGN OF A 
VERY-SMALL-ANGLE LIGHT-SCATTERING APPARATUS 
3.1 Aperture apodization 


In the simplest analysis, the ultimate angular resolution of any 
optical instrument is limited solely by diffraction. The expression most 


1242 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1976 


widely used to estimate the limiting resolution is the so-called Rayleigh 
criterion,” 
(A9) (Ad) & Xo. (31) 


The quantities A@ and Ad can be interpreted in two ways: 


(2) If Ad is the diameter of a “‘collimated”’ beam, then A@ is the 
actual angular spread of the beam imposed by diffraction. 

(12) If A@ is the collection angle for light emanating from an object, 
then Ad is the smallest spatial detail that can be resolved on 
that object. 


Neglecting for the moment the off-axis features of the actual MK VI 
instrument, we can duplicate its basic function with the two-lens 
system sketched in Fig. 6. Applying the Rayleigh criterion to this 
particular optical arrangement for a collimated beam diameter 
b = Ad = 5 cm, and with A», = 5000 A, predicts an instrumental 
angular spread 


Aé@payvieicn = 10 urad. (32) 


Unfortunately, taken by itself, this value for A@gayiricy contributes 
little in the way of a quantitative understanding of the instrument’s 
small-angle performance. In fact, the Rayleigh criterion can be mis- 
leading in a number of ways. First, it does not indicate how much light 
an object would have to scatter to be ‘‘visible’’ when the scattering 
angle approaches Aégayrercu. Second, it implies that the angular 
diffraction spread can be decreased to an arbitrarily small value by 
simply increasing the beam diameter b = Ad. In reality, the presence 
of unavoidable optical aberrations will always limit the attainable 
angular resolution. In designing an instrument which is to attain a 
resolution approaching the diffraction limit, a quantitative approach 
to the problem is mandatory. 






SPATIAL [~ 
FILTER 


ls SCATTERING 
OBJECT 


APERTURE 
PLANE 


Fig. 6—A simplified “lens equivalent’’ version of the MK VI optical system. 


LIGHT SCATTERING 1243 


An exact expression for the diffraction-limit resolution charac- 
teristics of an optical system may be obtained as follows. Let us assume 
we know the spatial dependence of the incident beam intensity on 
some surface in the optical system, say the aperture plane of Fig. 6. 
In our case, the electric field on this surface is of the form 


Hio(a, yer torr aat), (33) 


This aperture plane field can be decomposed into a set of infinitely 
extended plane waves, 


Ep(6, 9) = E(8, ve~@r-e0d), (34) 


propagating toward Le at various angles with respect to the 4 axis.’ 
The plane-wave amplitudes, 43(6, ¢), are found from the Fourier 
integral 


E3(0, ¢) = a J J dvdyEo(x, y)ei(e27t@u) (35) 
oJ Js 
together with the relations, 
1Q| =k = 2m/Xo 
Q,=Q-4=kosiné= ko (36) 
WH = 0:9 = kosin o=—kog. 


Equation (35) 1s just a slightly modified form of the usual scalar 
diffraction theory result which utilizes spherically spreading waves as 
basis functions.” 

Assume for the moment that lens Lz in Fig. 6 is infinitely large and 
free of aberrations. Then each of the plane waves, Hpn(@, ¢), is brought 


to a point focus in the slit plane at a position 
é= ftand= fo 

37 

n= ftang=S fe, 30) 


where f is the focal length of Ze, and the approximate signs hold for 
small angles. Combining (85), (86), and (87) gives the field in the 
slit plane as 


Kot é, n) = it If Ev(2, yet @nlfro) (etm dardy. (38) 

Pro J Js 
As eq. (38) shows, the field at the slit plane and the field at the aperture 
plane are related as Fourier transform pairs. It should be evident 


that eq. (38) can also be applied “backwards” in Fig. 6 to relate 
E(x, y) to the field at the spatial filter aperture, Eo(é’, 7’). We will 


* The cartesian coordinate and angle notation follows that adopted in Section IT. 
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refer to the field Eo(x, y) or its intensity 


T(x, y) = 3Veco/molo(x, y) L(x, y) 


as the illumination or aperture function. The slit-plane field Eo(&, 7) 
or its intensity, Jo(£, 7), is the corresponding instrumental profile. 

The procedure of aperture apodizing may be described simply as 
follows. The basic problem is to find and implement an instrumental 
illumination function, such that both the function itself and its trans- 
form have minimum spatial extent. The goal in a loose sense is to 
optimize the angular resolution per unit aperture opening. Of course, 
one of the general properties of Fourier transform pairs is that the 
second moments or ‘‘widths”’ of the pair members have an approximate 
inverse relationship. The Rayleigh criterion, in fact, is a simplified 
statement of this property. Even within the confines of this inverse 
relationship, however, there is still wide latitude for aperture apodizing, 
l.e., Shaping the instrumental profile to obtain particularly desirable 
angular or spatial characteristics. Although the Fourier transform 
relationship between the illumination function and the instrumental 
profile in coherently illuminated optical systems is well known,*—® 
aperture apodizing schemes are not often applied in optical instrument 
design. Apodizing schemes are, however, extensively employed in 
high-frequency and microwave antenna design,*4*> where they are 
used to create antenna systems exhibiting an angular directivity 
pattern that satisfies a particular objective. 

In designing an apparatus for very-small-angle light scattering, the 
principal objective is the ability to observe the weak scattered light 
in close angular proximity to the unscattered beam. The goal, then, is 
an instrumental profile that not only has small angular half-power 
points but, more importantly, continues down rapidly to the 107° to 
10-* level. The proper shaping of the illumination function, Ho(x, y) is 
absolutely crucial in obtaining this desired ‘“‘steep-skirt’’ behavior. 

In treating the question of aperture apodization for the MK VI 
instrument, we consider the optical system in the simplified form 
shown in Fig. 6. Therefore, the calculated instrumental profiles that 
are obtained below represent the instrument’s ideal, diffraction-limited 
performance in the absence of all aberrations. The ways in which the 
residual aberrations of the actual off-axis configuration modify these 
results are taken up in detail in Section 3.2. 

Given the idealized geometry of Fig. 6, the process of evaluating 
various illumination function/instrumental profile combinations can 
be further simplified by the following considerations. First, most of 
the interesting illumination functions and, therefore, their Fourler 
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transforms can be factored to the form 


Fio(x, y) = Eo(x) Ely) 

Bot, n) = Eo £) Eo(n). 
Second, in the MK VI apparatus, the open height of the main scanning 
slit guarantees that the measured slit-plane profile is the integral over 
all » of the slit-plane intensity Jo(£, 7). Under either of these circum- 
stances, we need consider only a one-dimensional form of eq. (88), 
namely, 


(39) 


a 
(fro)? 


where & = /f@. The corresponding aperture and slit-plane intensities are 
Iola) = 5 4/222 Hox) B3(2) 
Ho 


ae. €€0 1 
Io(£) = DA ae 


In presenting the results of calculations based on eq. (41), it is 
convenient to adopt a concise terminology to describe the spatial and 
angular widths of the functions involved. We use the following 
notation: 


D,(4)—Full width at half-maximum intensity for Jo(x) 

d.(3)—Half width at half-maximum intensity for [o(x) 

D;($)—Full width at half-maximum intensity for Io(£) or Io(£’) 

d:(3)—Half width at half-maximum intensity for Io(é) or Io(£’) 

Aé($)—Full width at half-maximum intensity for Jo(£) or Io(£), 
expressed as an angular equivalent via eq. (387), A@(3) 
= (1/f)D:(2) 

60(4)—Half width at half-maximum intensity for Jo(£) or Io(£), 
expressed as an angular equivalent. 


E,(£) = J Eo(a)et@n 0 eda, (40) 


(41) 





| E/o(a)et er! fro) Ezy 


For arguments other than (3) these quantities, give the width at the 
specified fraction of the peak intensity. Since the absolute normaliza- 
tion of the various intensity functions depends only on the total beam 
power, we present all results in terms of the ratio quantities: 


Io(x)/To(0), Lo()/To(0), Lo(@)/To(0), 
where J,(@) describes the slit-plane intensity with position & given in 
terms of the equivalent angular deflection 6 = &/f. 
Figure 7 shows the instrumental profiles calculated for two interest- 
ing illumination functions. The first is the uniform field 
_ (Eo; —b/2< 2s /2 
Box) = 1: otherwise 
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Fig. 7—Calculated instrumental profiles for uniform and gaussian illumination 
with b = 5.0 cm and o chosen to give equal values of 6@(3) for both profiles. The 
normalized illumination functions for each case are shown inset on a linear intensity 
scale. 


for which one can easily calculate the following intensity ratios: 


T9(x)/Io(0) = i —b/2 <2 <b/2 


otherwise 
_ sin? (kobE/2f) 
Io(&)/Lo(0) = “(kobe 272 (42) 
10(6)/9(0) = 22 C8) 


(gb /2)2 


where ky = (27/Xo). The second profile results from a gaussian aperture 
illumination, 


Eo(x) =H 5 e— 27/20? 
for which the relevant intensity ratios are 


Io(x)/Io(0) = exp (— 2?/o?) 
Io(£)/Io(0) = exp (— oko &/f?) (43) 
Io(0)/Io(0) = exp (— ok@6?). 
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The numerical parameters used in obtaining the curves plotted in 
Fig. 7 were 


ho = 5000 A, b= 5.0 cm, o = 1.496 cm. (44) 


The b value is representative of the maximum clear aperture of the 
MK VI apparatus; the value of o was arbitrarily chosen to give equal 
66(4) for both instrumental profiles. Also shown inset in Fig. 7 are 
the two aperture-plane intensity ratios, Io(x)/I (0). Note that the 
latter are plotted using a linear ordinate scale. 

As is evident from these two calculated instrumental line shapes, 
the use of gaussian apodization is vastly superior to uniform illumi- 
nation in regard to the observability of weak small-angle features even 
though the two J(@)/Io(0) profiles have identical half-widths. For 
example, at the 10—-° level we find | 


60(10-*) = 19.8 urad GaussIAN I(x) 
60(10-*) = 3183 uwrad uniFoRM I(z). 


In fact, from a theoretical standpoint, the gaussian is the ideal form 
of aperture functional. Among the families of possible illumination 
functions, it possesses a unique combination of two properties: (7) 
it has an extremely rapid shirt fall-off, and (72) it goes over into itself 
under the Fourier transform operation. In a general situation where the 
available aperture illumination has some arbitrary (x, y) behavior, 
gaussian apodization would have to be accomplished by interposing a 
suitable neutral density mask at the aperture plane. Fortunately, laser 
sources with a reasonable cavity configuration and oscillating only on 
TEM oo modes have an output beam intensity pattern which is accu- 
rately gaussian, except in the extreme tails of the profile. The avail- 
ability of such a source represents a crucial factor in the feasibility of 
constructing an instrument having the resolution and stray-light per- 
formance of the MK VI apparatus. 

In the actual MK VI instrument, the ratio of the focal length of 
lens £, to that of mirror M; was chosen to generate a gaussian illumi- 
nation function with an effective width 


o* = d,(1/e) = 0.826 cm (45) 


in the collimated beam portion of the apparatus (see Fig. 1). The 
instrumental profile calculated via eq. (41) for this value of o is plotted 
in Fig. 8. Also shown for comparison purposes is the profile to be 
expected if we uniformly illuminated the instrument’s maximum 
design aperture 

b* = 5.0 cm. 
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It is clear that for these specific values of o and b, gaussian apodizing 
no longer exhibits an absolute superiority over uniform illumination. 
Although the ‘‘gaussian”’ profile still reaches the 10—§ level much more 
rapidly, it does sacrifice resolving power to the “‘uniform”’ profile down 
to approximately the 10~? level. What this means to vsa scattering 
performance is the following. The “‘gaussian”’ instrument will excel in 
its ability to detect small amounts of light at very small scattering 
angles; however, it will not resolve approximately equal intensity 
features with as much detail as would the “‘uniform”’ instrument. As 
we see in the following paragraphs, the tradeoff, roughly speaking, 
involves paying for small-angle weak-intensity performance by sacri- 
ficing some ability to resolve the angular dependent features of the 
scattered light. This comparison can be made more quantitative by 
reference to Table IV, which gives various half-width angles for the 
profiles of Fig. 8. 
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Fig. 8—Calculated instrumental profiles for uniform and gaussian illumination 
with b = 5.0 cm and o = 0.826 cm. The 6 value corresponds to the maximum clear 
aperture of the MK VI instrument, while the o value corresponds to the width of the 
gaussian illumination actually used in the present apparatus. Normalized illumination 
functions for each case are shown inset on a linear intensity scale. 
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Table I¥V— Calculated instrumental profile half-widths at various 
fractions of peak Intensity for three different 
illumination functions 


Gaussian Uniform Truncated Gaussian 
: Illumination Illumination Illumination 
Ao = SOOO A is (1/e) = o* dz 1G ee b*/2 b=b* ao =a" 
= 0.826 cm 5.0 cm b/c = 6.05 
(urad) (urad) (urad) 
60(1/2) 8.0 4.43 8.09 
50(1/e) 9.6 5.23 9.69 
69(107) 14.6 7.36 14.62 
66(1072) 20.7 31.8 20.6 
66(1073) 25.3 100.6 25.5 
60(10-*) 29.2 318 29.7 
66(107-5) 32.7 1006 32.6 
66(1075) 35.8 3183 76.8 
60(10-7) 38.7 10,060 243 
66(10-8) 41.3 31,830 768 


From the inset plots of [(x)/I(0) shown in Fig. 8, it may seem that 
the gaussian illumination profile used in the present apparatus was 
unnecessarily narrowed relative to the instrumental full aperture. This 
is, In fact, not the case. One crucial detail which has been omitted in 
obtaining the results presented in Figs. 7 and 8 is the possible vignetting 
effect of the instrument’s maximum aperture. In calculating /(£) for 
the gaussian ’)(x), for example, the integral in eq. (40) was taken over 
all a, thereby neglecting any aperturing effects that might occur. 

For the actual vsa scattering instrument, which has a fixed maximum 
aperture, b, Ho(£), and /o(x) are related via the finite domain transform 


oo 
(fro)? 


Except for a few special cases, an analytical evaluation of this integral 
is not possible, and one must resort to a numerical approach to investi- 
gate various apodizing schemes. For the experimentally relevant case 
of gaussian illumination, eq. (46) becomes 


b/2 
L£(é) = cate Eo(a)et@r! fro) kad, (46) 


E,() = fo : i ent 207+i Qn] froizd a (47) 
(fro)? J2/2 


On completing the square in the exponential and a change of variable, 
we can rewrite this expression in the form 


E,(é) = V2ce~ (#12) a  e-utdy, (48) 


tir a z 
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where K, w, w;,, and w_ are defined as follows: 


w= 2. 77H 
Vio 
On 
J (49) 
_ b ok 
MET OGD 
bg K 
iO ae 


The analyticity of exp (—w?) near w = 0 allows the complex plane 
w integral to be split into two terms, each having the form of an error 
function of complex argument. Tabulated values of this function are 
available in the literature*® for a restricted range of the parameters 
(b/c) and (cK). 

In searching out an optimum configuration for the MK VI instru- 
ment, it was decidely more convenient to adopt a fully numerical 
approach in evaluating eq. (47). Appendix B outlines the methods 
that were used. The modified instrumental profile calculations were 
carried out for a range of values of the ratio (b/c) with the aperture 
opening, b, held fixed at b = b* = 5.0 cm. 

Figure 9 shows four such profiles plotted in terms of the normalized 
intensity ratio Jy(@)/I9(0). Also shown are the corresponding aperture 
ratios Io(a)/I9(0). The curve for (b/c) = 0.01 is essentially equivalent 
to the result obtained above for uniform aperture illumination. The 
most striking feature of the remaining three J(6)/Jo(0) curves is the 
presence of an effective background or floor contribution to the profile 
caused by edge diffraction at the aperture. This “‘shelf”’ or wing on the 
profile has the slow oscillatory decay of a (sin? x)/z? functional de- 
pendence. In each case, however, the 60 portion of the curves 
closely approximates the gaussian profile expected from unapertured 
gaussian illumination. 

The results given in Fig. 9 clearly illustrate the tradeoff involved 
in selecting a value of c. In circumstances requiring an instrumental 
line shape with a very low background level, we are forced to accept a 
moderate increase in 60(3) and, therefore, a loss in angular resolving 
power. The curve given in Fig. 9 for (b/c) = 6.05 corresponds to the 
choice that was made for the MK VI apparatus. Various half-width 
values for this profile have been included in Table IV for comparison 
with the results for unapertured gaussian and uniform illumination. 
In the actual instrument, this choice for (b/c) guarantees that the 
calculated edge-diffraction ‘‘floor’’ constitutes less than 10 percent of 
the overall stray-light level. This point is illustrated in Fig. 10 which 


LIGHT SCATTERING 1251 


ro-tL WS, 
Wel + 


i 
10-24 | 
t 
i 


\ . b/a = 0.01 

| haa r 

i iN W\ A Nis risa 
Mi Vf \f Vi f me ‘a 





—e eee coe 


' 
10-3L. | 
! 
i 
| 
| 
| 






t 
{ \ 
| | WA \ 
ror yl | | WW ALA ati 10-4 
| i in| 1 Wi \I 
fey eae YY 
10 | toy i M ! 107 
yk 1 
5 | i | | b/o = 6.05 i S 
‘o 1 a =0.826 cm 1: fo 
Seca TATA AC Ss 
= iim! | | 1 = 
me ie! | 1 i W f 10-7 — 
in| | 
10-8 | 10-® 
: t 
i | 
-~9 H fi _9 
19 lee bio =8.33—/ * | _ 
i ara o = 0.600 cm ! 
—10 ii ae Vail? - 
ie eter | 10-19 
10711 i a ae “11 
PPP ad | ‘ 
be 
iat? 10-12 
0 2 4240 #460 80 4100 120 3 2 1 0 


6 IN “RADIANS xX 1N cm 


Fig. 9—Calculated instrumental profiles for truncated gaussian illumination and | 
various values of (b/c). Also shown, on a logarithmic intensity scale, are the cor- 
responding normalized illumination functions [Jo(x)/Zo(0)]. The curves labelled 
(b/c) = 6.05 are appropriate to the b and o values used in the present apparatus. 
shows the theoretical profile for (b/c) = 6.05 superimposed on the 
measured profiles of the MK IV instrument. 

At this point, it is crucial to realize that the truncated transform 
results apply not only downstream from the aperture plane of Fig. 6 
but also upstream toward the spatial filter. There are, in fact, two 
other possible sources of beam vignetting in the system. The most 
obvious is the spatial filter itself. Since the field at the spatial filter 
and the field at the aperture plane are Fourier transform pairs, the 
same considerations involved in choosing (b/c) also apply to the choice 
of spatial filter pinhole size. If edge-diffraction effects at the aperture 
plane are to dominate the system profile, then the ratio of pinhole 
diameter, bey, to the gaussian focal width at the pinhole, d;-(1/e), 
must exceed (b/c). Specifically, for the present instrument, bp must 
satisfy the inequality 





bp b os bw 
jou (50) 


bpy > 58 um. 
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The pinhole diameter actually used is bp>y = 100 um. We note that a 
[bpy/d;-(1/e) | ratio this large is contrary to usual spatial filtering 
practices. 

A much more subtle source of possible beam aperturing is the internal 
cavity configuration of the laser source itself. Clearly, the ratio of 
laser tube inside diameter, byaser, to the mode (1/e) radius, cyaser, 
must also satisfy the inequality 


DLasER > oO (51) 

OLASER o 
The laser used in the MK VI has a gaussian mode diameter given as 
Dyaser(1/e?) = 1.4mm, (52) 
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Fig. 10—Calculated instrumental profile for truncated gaussian illumination 


(dashed curve) superimposed on the measured profiles of the MK VI instrument. 
Curves A and B correspond to the two measurements described in Fig. 4. 
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which is equivalent to 

D 1/e? 

OLASER — Prasen(T/e') ~ 0.5 mm. (53) 

The plasma tube 1p for the laser is bhasen = 5.0 mm yielding a b/c 
ratio 

basen = 10. (54) 

OLASER 
Therefore, in the MK VI instrument, the maximum design aperture 
available to the collimated beam is, in fact, the principal source of 
truncation effects. 

In situations where it is advantageous to alter this resolution-back- 
ground tradeoff by varying (b/c), it soon becomes apparent that the 
numerical profile calculations of Appendix B are a rather unwieldy 
design tool. Instead, based on an examination of the results shown in 
Fig. 9, it seemed tempting to fit the profile tails to the form 


In(@) _ sin? (kob8/2) 
(0) ~*~ (K9b8/2)? 





(55) 


and look for an interpolation formula relating the amplitude A, to 
the ratio (b/c). By a trial-and-error procedure, the following relation 
was found to reproduce the best-fit A, values to within 10-percent error: 


2 
ain (14%) mee 50) 


Table V gives the fitted and interpolated values of Ai corresponding 
to the four (b/c) ratios of Fig. 9. It is interesting to note that the 
exponential factor exp (—6?/40?), which dominates the (b/c) de- 
pendence, is just the normalized aperture illumination at the aper- 
ture edge. 


3.2 Optical aberrations 


The fundamental diffraction limitations set out in Section 3.1 are 
really only a prediction regarding the ideal performance of an optical 
system. In the final analysis, the inherent optical aberrations of any 
particular apparatus design determine how close one will come to 
achieving the ideal of diffraction-limited performance. In this section, 
we give a brief summary of those aspects of optical aberration theory*’ 
that are relevant to the design of the MK VI apparatus. From a 
qualitative understanding of and analytical expressions for each of the 
various aberrations, we then determine the extent to which aberrations 
modify the ideal diffraction-limited characteristics of the instrument. 
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Table V—Fractional amplitude of the edge diffraction contribu- 
tion to the instrumental profile of truncated gaussian illumi- 
nation. The first column gives the A; values obtained by 
fitting eq. (55) to the tails of the profiles shown in 
Fig. 9; the second column gives the A; value 
predicted by the interpolation formula 


in eq. (56) 
(b/c) A,-Fitted A;-Interpolated 
0 1.0 1.0 
3.333 1.33 x 107 1.49 X 107 
6.05 5.823 X 107% 5.92 X 107 
8.333 2.782 X 1077 2.79 X 1077 


Finally, a number of measurements taken on the MK VI instrument 
are compared to the quantitative predictions of the aberration theory. 

Because the present instrument is illuminated with monochromatic 
light, the various chromatic aberrations are absent, and the lowest- 
order non-zero distortions come from the third-order or primary 
aberrations. Here we follow the order-naming convention associated 
with the Taylor expansion of the function sin y, ie., 


3 5 
iny=y-S+8, (57) 


where y is the angle of incidence of a ray on a reflecting or refracting 
surface. The approximation sin y = yw leads to the usual paraxial optics 
formulae. The next term in the expansion, proportional to y, describes 
the primary aberrations. 

The principle aberration-producing elements of the MK VI ap- 
paratus are the off-axis spherical mirrors Ms; and Mg (see Fig. 1). 
Figure 11 shows the basic optical configuration in which the mirrors 
are used. The labelled geometrical parameters are: 


Uy = z —The half-field angle or off-axis angle 


@—The semi-aperture (58) 
R—The mirror radius of curvature. 


The primary aberrations for an off-axis spherical mirror depend 
parametrically on two angles: uy, the half-field angle and (@/R), the 
semi-aperture angle. The aberrations associated with the various 
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Fig. 11—Off-axis mirror configuration used in the MK VI instrument showing the 
important geometrical parameters. 


third-order products of these angles are: 
(@/R)*—Spherical aberration 
(@/R)2u,—Coma 
(@/R)uz—Astigmatism 
u3—Distortion. 


(59) 


3.2.1 Spherical aberration 


Spherical aberration is a longitudinal focussing defect that is present 
even when the off-axis angle goes to zero. Figure 12 sketches the basic 
ray geometry for a spherical mirror exhibiting pure spherical aber- 
ration. As illustrated in the enlarged detail of the sketch, the marginal 
rays of an incoming parallel bundle are brought to a focus at a point 
closer to the mirror’s surface than those lying nearer the axial ray. 
This constantly changing longitudinal focal position results in a trans- 
versely smeared focal spot rather than a focal point. 

One common measure of the amount of spherical aberration is the 
minimum beam waist size produced in the focal region. For a spherical 
reflector, the diameter of this blur spot is given by 


2TSC* = @3/R?. (60) 


Since transverse displacement at the focus is equivalent to an angular 

deviation in the parallel bundle, we can also express the spherical 

aberration in terms of a full-width angular blur, 

20'SC* 
f 


where f = F&/2 is the focal length of the reflector. 





Absc = = 2(@/h)*, (61) 


3.2.2 Astigmatism 


Astigmatism, like spherical aberration, is the result of a longitudinal 
focussing defect. In contrast to the spherical aberration defect, how- 
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ever, the longitudinal focussing error depends not on aperture diameter, 
2@, but on the off-axis angle, wu». Figure 13 illustrates the ray geometry 
of pure astigmatism for a spherical reflector. 

One of the fundamental characteristics of the w, dependent aber- 
rations is the loss of rotational symmetry in the focal region. A non- 
zero off-axis angle destroys this symmetry and establishes two unique 
directions or planes of transverse blurring. The plane defined by the 
incident and reflected axial ray is the tangential plane. Cartesian or 
angular displacements perpendicular to the axial ray and lying in this 
plane are referred to as tangential displacements. The two planes 
orthogonal to this tangential surface and containing either the incident 
or reflected axial ray are called the sagittal planes. Cartesian or angular 
displacements from the axial ray in these planes are sagittal 
displacements. 

For a spherical reflector exhibiting pure astigmatism, a fan of 
parallel tangential plane rays are brought to a focus closer to the mirror 
surface than an identical sagittal fan. The focal region pattern found 
by decomposing the entire illuminated aperture into such ray fans 
consists of the two longitudinally separated focal lines depicted in 
Fig. 13a. The longitudinal (§) separation of the S and T foci (2AC*) 
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Fig. 12—Ray diagram for a single spherical mirror exhibiting a pure spherical 
aberration defect. 
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Fig. 13—Partial ray diagram illustrating the focal plane characteristics of an off-axis spherical mirror exhibiting the pure astigma- 


tism defect. 
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can be calculated from the relation 
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x 
ee 


(62) 
and depends only on the semi-field angle. 

In the absence of other aberrations (and the effects of diffraction), 
the S and T' focal lines are infinitely narrow in their respective planes. 
This means that the sagittal height of the tangential focus and the 
tangential width of the sagittal focus can be obtained from simple 
extremal ray geometry. For example, Fig. 13b shows the extremal rays 
seen in a tangential plane projection. Normally 2A C* is small compared 
to the reflectors focal length (#/2). It follows that the lengths of the 
two focal lines are identical and given by 


2 
cs se inden ieee 2 
2TAC* = 2AC ( : 3) 2au2. (63) 
The full-width angular spread equivalent to this spatial blur is 
Aérace = 4(@/R)ui. (64) 


3.2.3 Coma 


When the off-axis angle is non-zero, the longitudinal focussing error 
that produces spherical aberration also gives rise to an asymmetric 
transverse blurring called coma. Figure 14a sketches the basic elements 
of the focal region pattern for a spherical reflector exhibiting a pure 
coma defect, Rays in the paraxial region are brought to a focus at the 
axial focus, P, while rays from larger-diameter annular zones on the 
mirror’s surface form focal circles whose centers are tangentially dis- 
placed from P. The radius of a particular focal circle increases as the 
square of the radius of the zone producing it. 

Figure 14b gives a qualitative representation of the characteristics 
of the focal pattern as found by dividing up the illuminated aperture 
into these annular zones. Mathematically speaking, the focal circles 
are not sharp unless the radial thickness of the corresponding zones 
vanish; however, the sketch does predict quite nicely the overall 
exterior outline of the coma blur patch. 

The continuum of focal circles nest into a 60° wedge extending out 
from the axial focus forming a pattern commonly called comatic flare. 
The largest-diameter focal circle, produced by the annular zone at the 
edge of the illuminated aperture, has a radius CC* given by 


QU p 


CO" = OR 


(65) 
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(a) 


Fig. 14—Partial ray diagram, (A), of an off-axis spherical mirror exhibiting a 


pure coma defect. The details of the focal plane pattern are illustrated in (B). 


Its center is tangentially displaced from the axial focus by an amount 


2CC*, 


Although coma is a highly asymmetric aberration, it is still con- 
venient to specify its effect in terms of transverse and/or angular blur 
sizes. The numbers ordinarily quoted for coma correspond to the 
extremal dimensions of the coma patch in the tangential and sagittal 
directions. It follows easily from Fig. 14b that the full-width trans- 


verse spatial blurs are 





; 2 
TI pirEcTION 27CC* = 3C0C* = sau 
re 2 

S DIRECTION 2SCC* = 2CC* = ame 


The equivalent full-width blur angles are 


T DIRECTION AOrec = 3(@/R)2u> 
S DIRECTION Agco = 2(@/R)?up. 
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(66) 


(67) 


3.2.4 Distortion 

The last of the primary aberrations is a defect of off-axis magnifi- 
cation called distortion. This particular aberration is associated princi- 
pally with optical systems that form a real image of an extended object 
at finite magnification. For example, the image of a rectangular grid 
of regularly spaced points will exhibit the classic “barrel’’ or “‘pin- 
cushion”’ appearance in an optical system involving pure distortion. 
For the situation of interest here, namely a spherical reflector with 
the image at infinity or at the focus, the amplitude of the pure distor- 
tion aberration vanishes identically. 


3.2.5 Application of the aberration results to the MK IV instrument 

The third-order aberration theory results for the off-axis spherical 
reflector are summarized in Table VI which gives the expressions for 
the various transverse and angular blurs. It must be emphasized that 
the aberration theory results outlined above are derived from purely 
geometric ray tracing. In no sense does this theory predict the actual 
intensity distribution in the image plane for a specific aperture illumi- 
nation. The transverse and angular blur patterns define the outlines 
of a boundary between focal illumination and strict geometric shadow 
in the absence of all wave interference and diffraction effects. However, 
in certain situations, it is possible to combine the geometric aberration 
results with aberration-free diffraction calculations to obtain useful 
instrumental profile information. The approach works well when one 
or more of the following conditions are satisfied : 


(7) Diffraction blurring is large or small compared to spherical 
aberration. 
(22) One primary aberration is dominant. 
(1zt) The ideal diffraction-limited system profile is free of large 
interference maxima and minima. 


Table VI— Summary of the analytic expressions for the 
transverse and angular aberration blurs for a single 
off-axis spherical mirror 


Aberration Full-Width Transverse Blur Full-Width Angular Blur 
Spherical @3/R?2 (2 TSC") 2(@/R)8 
Coma T -3@%u,/2R (3 CC*) —3(@/R)up 
S —2@u,/2R (2CC") —2(@/R) up 
Astigmatism 2auz, (2 TAC") 4(@/R)u3 
Distortion BuiR 2 Bus 
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In these cases, one can intuitively construct the aberration-affected 
profiles with a fair degree of accuracy using the one-to-one geometric 
mapping of regions of the aperture onto the focal plane. For example, 
it can be argued from Fig. 138b that for pure astigmatism the tangential 
direction intensity profile at the sagittal focus should be a demagnified 
replica of the aperture illumination. This type of analysis is essential 
in obtaining quantitative results from the aberration expressions. 

In the MK VI instrument, the off-axis angle and radii of mirrors 
M, and M, are 

y = 2|u,| = 0.116 rad 


P= 200-em. (68) 


Assigning a value to be used for the semi-aperture @ is a more subtle 
question, especially since we are interested in gaussian rather than 
uniform aperture illumination. However, in the spirit of the astigma- 
tism example given in the preceding paragraph, we take 


@ = d,(1/e) = o* = 0.826 cm (69) 


and assume that the focal plane profiles will also be gaussian. On the 
basis of geometrical imaging, the blur values calculated via Table VI 
should then be (with the exception of coma) the full-width to the 
(1/e) points of a focal plane gaussian profile. We show in succeeding 
paragraphs that these assumptions lead to a selfconsistent picture 
of the experimentally observed aberration effects in the MK VI 
instrument. 

Table VII gives the transverse and angular blurs for a single spherical 
reflector in the MK VI configuration. The table also includes the 
longitudinal separation of the S and T focal planes as well as the full 
angular width of the focus imposed by diffraction. From the viewpoint 
of small-angle-scattering performance, the most serious of the aber- 


Table VIl— Numerical values of the aberration blurs for a single 
spherical mirror used in the MK VI configuration 


Reais pe paca aa ae Blur 
Spherical 0.141 0.141 
Coma T 2.96 2.96 
S 1.97 1.97 
Astigmatism 55.6 55.6 
Distortion 0 0 


(S-7')separation = 2 AC* = 0.336 cm. A@ (1/ €)diffraction = 19.3 prad. 
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Table VIIl— Numerical values of the aberration blurs for the 
system of two off-axis spherical mirrors used 
in the MK VI apparatus 


‘Aberaton ea ae Full-Wid a at a Blur 
Spherical 0.282 0.282 
Coma T 0 0 
S 0 0 
Astigmatism 111.2 111.2 
Distortion 0 0 


(S-T separation => 2(2 AC") = 0.672 cm. A6(1/é)aittraction = 19.3 prad. 


rations 1s coma. Even though the calculated comatic blurs are numeri- 
cally small compared to the diffraction spread, the presence of coma 
can result in a distinctly asymmetric instrumental profile. Moreover, 
in a coherently illuminated system, the coma flare is criss-crossed by 
interference patterns whose tails extend far beyond the calculated 
geometric limits. This latter effect can significantly raise the effective 
“‘floor’’ level of the instrumental profile. 

Fortunately, in a symmetric two-mirror system like the MK VI, 
the geometry may be chosen such that the total coma vanishes identi- 
cally. In fact, all aberrations that depend on an odd power of the half- 
field angle wu, disappear if the field angles at the two elements are 
made equal and of opposite sign. By convention, wu, 1s defined as the 
angle through which the incoming axial ray must be rotated to bring 
it into coincidence with the local radius vector of the element’s spheri- 
cal surface (see Fig. 8). An inspection of Fig. 1 shows that in the MK VI 
apparatus the field angle rotations at 17; and M, are of opposite sense. 
In this case, when the two off-axis angles are made equal in magnitude 
the coma and distortion aberrations vanish while the spherical aber- 
ration and astigmatism double. 

The total calculated aberration blurs for the instrument are sum- 
marized in Table VIII. Since the spherical aberration is small com- 
pared to the diffraction spread, it 1s reasonable to expect that the 
interpretation of these blur values as (1/e) full-widths of a gaussian 
blur profile should work quite well. This is in fact the case. 

The tangential and sagittal foci of the MK VI instrument were 
located using a modified Foucault knife-edge procedure, and 7 direc- 
tion scans of the intensity profiles were taken in each case. The mea- 
sured S-T separation was 


(S-7') separation = 0.660 + 0.013 cm. (70) 
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Fig. 15—Observed instrumental profile as measured by a tangential direction 
scan in the instrument’s tangential focal plane. The heavy dots are a best fit to the 
function exp [ —6?/66?(1/e) ]. 


At the T focus, only diffraction and spherical aberration contribute 
to the profile width. From the standard gaussian convolution formula, 
we can calculate the expected A@(1/e): 


T PLANE ~ (19.30)? + (0.282)? 
P scan Ad(1/e) = v(19.30)? + (0.282)? uwrad 
= 19.302 prad. (71) 


Clearly the spherical aberration has a negligible effect on the ideal 
diffraction-limited broadening. Figure 15 shows a typical high-resolu- 
tion T-plane scan for the instrument. The large apparent noise in this 
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trace is produced by residual air currents and vibration in the ap- 
paratus and corresponds to a peak-to-peak beam wander of roughly 
0.5 wrad (0.7 arc second). The results of fitting the observed profile 
with a gaussian shape are indicated by the points in Fig. 15 and give 
an experimental full width 


Ad(1/e) = 260(1/e) = 19.2 prad. (72) 


In the S focal plane, diffraction, astigmatism, and spherical aberration 
all contribute to the instrumental line shape. The full-width calculated 
from Table VIII is 


S PLANE 
IT SCAN 


A@(1/e) = V(19.3)? + (0.282)? + (111.2)? wrad 


113 prad. (73) 


The predicted width comes predominately from the astigmatic blur- 
ring. Figures 16 and 17 give two experimental S-plane profiles recorded 
with logarithmic and linear intensity scales, respectively. A gaussian 
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Fig. 16—Observed instrumental profile as measured by a tangential direction scan 
in the instrument’s sagittal focal plane. The heavy dots and dashed curve a are 
best fit to the function exp [— ¢°/6¢*(1/e) ]. 
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Fig. 17—Observed instrumental profile as measured by a tangential direction scan 
in the instrument’s sagittal focal plane. The heavy dots are a best fit to the function 
exp L— ¢?/6¢?(1/e)]. 


fit to the logarithmic curve, indicated by the points in Fig. 16, gives 
very good agreement with the observed line shape over roughly four 
orders of magnitude in intensity. The range and precision of the fit 
provide strong support to our assumptions regarding the interpretation 
of the aberration blur values. The best-fit half-width values for the 
logarithmic and linear scans are 66(1/e) = 57 wrad and 66(1/e) = 55 
urad, respectively. The mean observed full-width 


Aé(1/e) = 112 prad 


is in excellent agreement with the calculated value given in eq. (73). 
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3.2.6 Conclusions 


In summarizing the discussion of aberrations, a number of general 
points deserve to be made and reiterated regarding the relationship 
between aberrations and small-angle-scattering performance. 


(2) The cancellation of the asymmetric aberrations, coma especially, 
is crucial in obtaining an instrumental profile that has symmetry, the 
necessary steep skirt fall-off, and a low background value. 

(27) Spherical aberration, although it has a negligible effect relative 
to diffraction in the present instrument, can rapidly grow to serious 
proportions with increasing aperture size, @ = d,(1/e). The angular 
blur of this aberration, A@sc, increases as the cube of the aperture size 
while the diffraction spread varies inversely with d,(1/e). The relative 
contribution of spherical aberration to the profile width will, therefore, 
increase as @*. Since Aés- depends only on the reduced quantity (@/R), 
however, a constant ratio of spherical blur to diffraction spread can 
always be obtained by scaling the mirror radius R to keep @*/R? 
constant. For example, an instrument with 10 times better angular 
resolution than the MK VI might conceivably utilize 80-cm-diameter 
mirrors with a 21.5-meter focal length. 

(212) The presence of a large residual astigmatism need not be 
detrimental if one is satisfied with an instrumental performance that 
is diffraction limited in only a single angular direction. It might appear 
from Fig. 13 that simultaneous sagittal and tangential resolution could 
be achieved by placing separate slits at the S and T foci of the collecting 
mirror. This is true if the wavefronts of the incoming ray bundle are 
perfectly parallel. However, in a two-mirror symmetric apparatus, such 
a bundle cannot be produced because of the collimating mirror astig- 
matism. For example, with reference to Fig. 1, the spatial filter pinhole, 
As, can be placed at either the 7’ or S focus of mirror MW. In the first 
case, the wavefronts of the beam travelling toward M; are tangentially 
collimated but sagittally curved; in the second case, the converse is 
true. From the viewpoint of light-scattering kinematics, this ‘‘col- 
limated” beam will be able to conserve momentum with a relatively 
broad range of scattering vectors lying in the plane containing the 
waveiront curvature. Thus, even though mirror M, forms S and T 
focal lines of equal sharpness, only a single high-resolution axis actually 
exists for either position of pinhole Ag. 

In the MK VI instrument, the spatial filter pinhole is at the T 
focus of the collimating mirror so that the probe beam is tangentially 
collimated. The nature of the wavefront curvature in the sagittal plane 
may be calculated in a simple fashion from the known (S-T) separation 
2AC*, Since the pinhole (at the T focus) is closer to the mirror’s surface 
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Fig. 18—Illustration of the wavefront curvature existing in the off-axis mirror 
collimating arm of the MK VI instrument. 


than the S focus, the sagittal plane wavefronts appear to diverge from 
a point source lying behind the mirror. The object distance, ¢g, between 
this virtual point source and the mirror, may be calculated via the 
usual paraxial formula. Since (S-7) = 2AC* is small compared to the 
focal length of Ms, we have 


ee ee a 

1 = 3AC* ~ FGAC*) fe) 
Inserting the appropriate numerical values, & = 200 cm and 2AC* 
= 0.336 cm, gives 


q = 3.98 X 104 cm. (75) 


Figure 18 shows a sketch of the probe-beam constant-phase surfaces 
with the sagittal curvature greatly exaggerated for clarity. The magni- 
tude of the curvature can be specified in terms of the longitudinal 
spatial separation between the wavefront and a reference plane which 
is tangent to the wavefront at the axial ray. Since the virtual sagittal 
source point lies so far behind the mirror, it does not matter exactly 
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where in the optical path we calculate this separation. The maximum 
deviation between the wavefront surface and the reference plane, Vaz, 
occurs at the sagittal extremes of the beam and is easily found to be 
given by 


Vz = —- (76) 


If we take @ = b*/2, where b* = 5.0 cmis the maximum clear aperture 
diameter in the MK VI instrument, we have 


Vz = 10.5 X 10-° cm, 


or roughly 2A of peak deviation from perfect collimation. In a situation 
where the full sagittal (7) aperture height need not be used, it is 
possible to reduce this deviation substantially since Vz is propor- 
tional to @?. For example, if we aperture the height of the probe beam 
to 6b = 5mm, the peak wave-front deviation is reduced to Vz = \/50, 
or essentially perfect collimation. Thus, the most desirable probe- 
beam configuration in the MK VI instrument corresponds to a “‘flat 
ribbon” or “‘sheet”’ type of illumination. 


3.3 Scattered field intensity, spatial coherence, and scattering kinematics 
in the presence of aberrations 

In a light-scattering optical system whose angular resolution capa- 
bilities are in some respect dominated by aberration effects—for 
example, the astigmatic ¢ blurring in the present instance—we find 
that other important properties of the observed scattered field are 
modified by the aberrations as well. In this section, we examine three 
aspects of normal light-scattering theory that are qualitatively altered 
by the presence of aberrations: 


(2) The form of the spatial coherence function for the scattered 
field. 
(2t) The application of the normal kinematic restrictions (or wave 
vector conservation conditions) in the scattering process. 
(222) The calculation of the amplitude of the perturbations that 
give rise to observed levels of scattered or stray light. 


The effects of aberrations in all three cases have a straightforward 
physical interpretation connected with the fact that light scattered 
into a specific direction (@, g) is no longer brought to a diffraction- 
limited spot focus at the observation plane. 

In typical calculations of the scattered field, in which an incoming 
plane wave is assumed to impinge on the sample, the far-field angular 
distribution of scattered intensity is shown to be simply the spatial 
Fourier transform of the refractive-index perturbations in the illumi- 
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nated volume.!? The light observed at a particular angular position 
(8, ¢) is contributed by a Fourier component of the refractive index K 
given by 


K =k, — ko, (77) 
where ko is the wave vector of the incident beam and k, points in the 
direction of observation (6, y) and has a magnitude |k,.| = |ko| 


= 2r/\o. For a finite illuminated volume, the refractive-index per- 
turbations are most usefully represented in terms of a plane-wave 
Fourier expansion 


> dn; exp (7K;-r’), (78) 
j 


with the K; chosen to make the expansion functions orthonormal over 
the scattering volume.”’ The scattering of a collimated incident beam 
by this assembly of plane waves consists of a family of diffracted 
beams that originate from the K,’s satisfying the Bragg condition, 
eq. (77). On the surface of a sphere in the far field, these diffracted 
beams form a contiguous but essentially nonoverlapping series of 
diffraction “‘spots,’’ each associated with a particular K;. In the usual 
situation, where the amplitudes of the individual K; disturbances are 
statistically independent, these patterns also delineate areas or solid 
angles of statistical field correlation. If the far-field scattered radiation 
is focussed onto the observation plane by an ideal lens or mirror, this 
contiguous angular distribution of ‘‘spots’’ is imaged one-for-one onto 
the focal plane. A ray penetrating the reference sphere at an angular 
position (6, ¢) is imaged onto the focal plane at a transverse position 
(€, 7), where, in the small angle limit, 


g= ff, n= fe. (79) 


In this ideal situation, the intensity observed at some (£, 7) 1s scattered 
essentially by a single K; plane-wave mode. The measured intensity 
may, in theory, be used to calculate the mean-square-amplitude of 
the mode, or vice-versa. Furthermore, the spatial coherence properties 
of the field at the observation plane are determined uniquely by the 
angular distribution of intensity within one of the diffracted beams. 

Formally speaking, the presence of aberrations in the imaging of 
- the reference sphere scattered field produces qualitatively the same 
effects as any other imperfect focussing of the far-field pattern. The 
pattern of diffraction spots will be formed with a degree of spot 
broadening and overlap that depends on the nature and extent of the 
focussing defect. The scattered light reaching a specific (£, ) point 
at the observation plane is no longer associated with a single K; 
disturbance, but is an appropriately weighted sum of contributions 
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from a number of modes. As a result, the Bragg condition, eq. (77), 
is not strictly applicable in relating a particular (£, 7) observation 
point to a specific plane-wave disturbance. The aberration or defocus- 
sing effect must be understood in detail before the measured intensity, 
or its time evolution may be used to infer the physical behavior of 
modes responsible for the scattering. Under defocussed conditions the 
spatial correlation function 1s also modified, though its functional form 
does retain a close resemblance to the intensity pattern associated 
with a single K;, diffraction ‘‘spot.”’ In the following paragraphs we 
consider how the residual astigmatic blurring in the MK VI instrument 
affects the three slit-plane field properties enumerated in the opening 
paragraph. 


3.3.1 Kinematic relations 


To understand how the wave vector conservation criterion 1s to be 
applied at the slit plane of the present apparatus, we need to know 
(t) the slit-plane intensity pattern formed by scattering from a single 
plane-wave disturbance, and (27) the relative positioning of the spots 
from the various allowed K;. In the MK VI instrument, the intensity 
pattern associated with a single K; is identical to the diffraction- and 
aberration-affected instrumental profile whose properties were treated 
in detail in Sections 3.1 and 3.2. At the slit plane, therefore, the single 
K; diffraction spots have the elongated gaussian shape depicted in 
Fig. 5. Expressed in terms of angular coordinates via eq. (79), the 
normalized intensity distribution within a ‘‘spot”’ is simply 


16, 9) _ O05)... toe, 
I(6;, 91) oP S 5@?(1/e) oe 52(1/e) (80) 


The reference point (6;, 9;) specifies the angular position of the spot 
center, which, in the present case, is correctly predicted by the Bragg 
condition, eq. (77). 

Given a correct form for the intensity distribution within a single 
K; pattern, we must still determine the slit-plane spacing of the spots 
associated with the family of allowed K;. Clearly, this spacing depends 
on the reciprocal lattice of the orthonormal expansion functions 
exp (7K;-r’) which, in turn, is fixed by the geometry of the scat- 
tering volume. For sufficiently small scattering angles, the actual 
three-dimensional scattering sample can be taken to be equivalent to 
a two-dimensional phase object placed normal to the incoming probe 
beam.”® The scattering disturbances in this “‘phase-sheet’’ may be rep- 
resented in terms of a two-dimensional plane-wave Fourier expansion 


2 2 ->-exp (2K22) exp (7K,y), 
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with the (K,, Ky) chosen to make the expansion functions orthonormal 
over the instrument’s full aperture. For a rectangular aperture with 
full-width x and y dimensions 0, and b,, the allowed K; can be obtained 
from the cyclic boundary condition relations 


Q2rm 27TNn 





K.(m)=—=—, KK, (n) = (81) 
b,, by 
where m and n are the integers 
m,n =O, +1, +2,::-. (82) 


The scattering angles (@;, 9;) for the central ray of the diffracted beam 
produced by a particular K(m, n) are then given by the small-angle 
Bragg conditions, 





OC oe ee om 
- (83) 
Tv 
K,(n) = hoon = 22. 
¥y 


It follows that the family of diffracted beams are brought to a focus 
at the slit plane on the vertices of a rectangular mesh whose grid 
spacings are given by 


Esp = fOsp = s(3) (84) 
nse = fosr = f( 72): (85) 


For the MK VI instrument at full aperture (b, = 5.0 cm, by = 5.0 cm), 
the equivalent angular mesh spacings are 


Asp = gsP = 10 urad. (86) 


If we imagine the instrumental profile contours of Fig. 5 arranged on 
such a mesh, there will be little overlap in the @ direction but consider- 
able overlap along ¢. The light received at some (0, ¢) point in the 
slit plane will contain contributions from roughly 10 distinct K(m, 7), 
each having the same m(K,) index but differing n(K,) components. 
Because the MK VI instrument is capable of probing 6 values so close 
to the diffraction limit, corresponding to a very small m index, 
m = 5-800, these multiple contributions can prove a serious problem. 
This point is illustrated in Fig. 19, which shows the (1/e) contour of 
a single K(m, n) intensity pattern centered at 6 = 80 uwrad, » = 0 
superimposed on the slit-plane mesh of the (@m, gn). The @ and ¢ axes 
of the figure can also be labelled in terms of the wave vector com- 
ponents K, and K, to which the angles are directly proportional as 
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Fig. 19—Resolution function of the MK VI instrument superimposed on the 
slit-plane mesh points corresponding to the allowed scattering vectors K;. 


indicated in eq. (83). For the eleven mesh points falling within the 
contour, the magnitude of the associated K(m, n) varies considerably. 
For the point at the center of the contour, we have (m = 8,n = Q) and 


|K| = 22[(m/bz)? + (n/b,)? |? = 10.0 em-, 
while for the points at the y extremes (m = 8, n = 5), we find 
|K| = 11.9 em—. 


When the physical properties of the modes are strongly |K| dependent, 
this overlap can lead to a difficult task in the interpretation of the 
measured intensity and/or its time dependence. 

Clearly, the |K| smearing effect becomes less significant as 6 in- 
creases. Less obvious is the fact that the problem of multiple K; 
contributions can be alleviated by stopping down the beam height of 
the instrument, by. As was pointed out in the conclusion of Section 
3.2, the cylindrical distortion of the probe-beam wave fronts, which is 
a manifestation of the collimating mirror’s astigmatism, can be made 
negligibly small by reducing b,. With b, S$ 0.5 cm, for example, the 
beam incident on the sample can be considered as collimated to within 
the diffraction limit. In this case, the usual kinematic conditions apply 
in relation to the far-field scattered light; that is, the far-field array of 
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“spots”? now form a contiguous and nonoverlapping pattern. Of course, 
each spot is elongated along ¢ by diffraction-spreading because of the 
imposed asymmetry of the probe-beam dimensions, 1.e, bz = 5.0 em and 
b, S$ 0.5 cm, but the angular spot spacings @sp = (A/bz) and 
gsp = (A/by) are correspondingly asymmetric. Given this particular 
situation in the far field, we must still consider the effect of the astig- 
matism associated with the light-collecting mirror. As b, is decreased, 
the ¢ direction astigmatic blurring at the slit plane is decreased pro- 
portionally,* while ¢ blurring due to diffraction increases. At some 
point, a crossover occurs beyond which diffraction spread dominates 
the slit-plane imaging. In this limit, the collecting mirror appears 
aberration free and the far-field pattern of K, spots undergoes the 
normal one-to-one, no-overlap mapping onto the slit plane. For the 
MK VI configuration, the crossover occurs at b, 0.5 em or roughly 
zo of the full design aperture height. Of course, it should be noted that 
under fully diffraction-limited conditions (b: = 5.0 cm, by < 0.5 cm), 
the instrument retains its very asymmetric resolution profile. What we 
have done is to make the instrument appear to be in “‘good focus’’ by 
introducing a sufficient amount of ¢ direction diffraction spreading to 
swamp the aberration defocussing. The price paid for this is that the 
instrument becomes incapable of probing scattering disturbances 
having as long a g direction wavelength (that is as small a value of K,) 
as can be resolved in the £ direction. 


3.3.2 Relation between the slit-plane intensity and the scattering 
cross-section of the sample 

In the absence of aberrations or other defocussing problems, the 
far-field or observation plane scattered intensity can be related easily 
to the mean-square amplitude of the scattering perturbations using the 
standard integral expression for the scattered field.!:3-?7 In the presence 
of imaging aberrations, the total scattered power per plane-wave mode 
is unchanged ; however, now the calculation of the slit-plane intensity 
is complicated by the overlap of the various K; diffraction spots. For 
the MK VI instrument, a relation between the scattered power per 
mode and the observed slit-plane intensity may be obtained as follows. 

The slit-plane intensity corresponding to a single K; disturbance 
can be written down formally as 


|Enm(9, ¢) |? 
(0 = a | (¢ es On)? 
~ 2 - eye 2 
| Enm(Om, Yn) | exp 662(1/e) exp 5 ¢2(1/e) (87) 
with slit-plane position specified in angular units. The position of the 


*See Figure 13 and the discussion pertaining to this figure. 
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central ray of the pattern (8m, gn) is given by eq. (83). The total 
scattered power in the single K; pattern is just 


J [d0de| Bam, ¢) |? 


760(1/e)5 p(1/e) | Bsa (Ons Yn) ke (88) 


The problem now is to relate the observed total slit-plane intensity 
to the peak mode intensities, |Znm(Om, ¢n)|?. The intensity observed 
at the position (6, ¢) is the sum over all possible mode contributions: 


|E5(@, ¢) |? 


_ 7 9 (6 — Om)? _ ( = Qn)? . 
rae a |B nm (Om; Cn) | exp 66?(1/e) | exp | sO (89) 


If |Enm(Om, On) |? is independent of (n, m) over the range where the 
gaussian terms are nonvanishing, eq. (89) can be simplified to give 


P, (K;) 


I 


|E'.(8, 9) |? = |Enm(Om, pn) |?SmSn, (90) 

where S,, and S, are the factored sums 
Sn = pe exp — {L@ — m(Ao/bz) ?/562(1/e) } (91) 
= exp — {Le — m(ro/b,) P/5e*(/e)}. (92) 


Consider the Sm sum expressed in the following dimensionless form 


6 A\_1i PP 
Sm = 2, exp — are ce (3:) 7s | 
= >> exp (—[u — ma), _ 
where 


0 1 Ao. 


= wy)’ 9° = We) ba i 


Although the indicated summation cannot be carried out explicitly, 
it can be expressed in a more useful form via the identity 


= exp [—(u — ma)?] = 2 x exp (- _ 





} cos (2mmu/a), (95) 


which is an immediate corollary of Poisson’s formula.*® Applying this 
identity to the S,, summation yields 


vx v760(1/e) _ m?x*66?(1/e) 2mmd | 
8 = 75g SP | ay es | By | OO 
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For the numerical parameters relevant to the MK VI instrument, 
60(1/e) = 9.6 wrad, (A/bz) = 10 wrad, we need retain only the m = 0 
term in the right-hand side of eq. (96). With an error less than 1074, 
we may take 


Vr50(1/e) | 
~ (A/bz) 


A similar result follows easily for Sn. We then have for the total slit- 
plane intensity 


Sn = (97) 


Vr 60(1/e) Vr 6e(1/e). 
~— (X/bz) (A /By) 


As this result shows, the various elongated gaussian patterns associated 
with the individual K; modes overlap in the slit plane to produce an 
essentially uniform illumination. 

The unknown intensity factors | Hnm(@m, on) |? can now be eliminated 
between eqs. (88) and (98) to give the desired relationship between 
the overall slit-plane intensity and the scattered power per mode, 
namely, 


| Hs(8, 9) |? = |Bam(Om, Gn) |? (98) 


P,(Kj) = |£.(6;, 93) |?(A/bz) (A/by). (99) 


Equation (99) is the basic result which allows the measured intensity 
to be related quantitatively to the amplitudes of the individual 
scattering perturbations.* 

Note that the power actually contained within the angular area of 
a single K; pattern 


| #5(8;, gi) |?60(1/e)6 e(1/e) 
is larger than the scattered power per mode by the factor 
60(1/e) dy(1/e) | 
(A/bz)  (A/by) 


This ratio gives a rough gaussian weighted measure of the number of 
modes that contribute to the intensity reaching a particular (6, ¢). 








3.3.3. The spatial coherence function of the slit-plane field 


In light-scattering experiments designed to extract spectral infor- 
mation from the scattered field using photocurrent correlation tech- 
niques, the feasibility of a particular measurement is critically de- 
pendent on the range of transverse spatial correlation that characterizes 
the observation plane field.?:*27 The extent of the correlation is de- 


* See the discussion which follows eq. (113) and leads to eq. (124). 


1276 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1976 


scribed quantitatively by the normalized mutual coherence function” 


. _ (ELC E, 95 2) Es (é + AE, 9 + An; t)) 
1G AG AD = TRE a IESE FAG a+ An; PD] 


where (& 7) and (& + A&E, » + An) are two arbitrary points in the 
observation plane. The angular brackets denote an appropriate en- 
semble or time average. The function 7T(---) reaches its maximum 
value, 7(---) =1, for A& = An =O and, in general, decreases 
smoothly to zero as A£é and/or Ay increase. The contour in AE and Ay 
around (£, 7) on which the coherence function reaches some specified 
numerical value may be taken as a measure of the area over which 
there is correlated temporal behavior of the two field amplitudes. 

When the main probe beam is derived from a source having perfect 
transverse spatial coherence, as is the case here, then the presence of 
the spatial incoherence in the scattered field is totally attributable to 
the scattering processes taking place in the illuminated volume. The 
spatial coherence properties of the scattered field are uniquely deter- 
mined at the exit face of the sample and are most easily specified 
analytically by calculating the mutual coherence function on a far- 
field reference sphere, 0, centered on the scattering volume. In purely 
formal terms, we can write 


(100) 


_ (E,(r, t)-E;(r + 9, 0) 
Molt, &) = TE, DEG + 0, JI 


where r andr + o both terminate on the surface of the far-field sphere, 
0. Generally speaking, To(r, 9) can be calculated in a straightforward 
fashion once it is assumed that the scattering perturbations satisfy 
certain basic stochastic criteria. 

The relationship between the observation plane coherence function 
T(&, 7; A&, An) and the far-field function T(r, 9) depends, of course, 
on the detailed characteristics of the optical system which collects and 
images the scattered light, and must include the effects of aberrations. 
There are two alternative procedures that may be used to obtain this 
relationship. The first involves the use of the plane wave K; expansion 
of the scattering perturbations that was introduced in the beginning 
of this section. For the MK VI instrument, we have already calculated 
the slit-plane field produced by the scattering from the individual K,. 
In the notation of eq. (89), we have 


E,(£, 7; Kj) 


= (é _ Em)? (n Nn)? 
= Enm(&m; n) EXP | - 2 f5@(1/e) eXD | - 275 e(1/e) | , (102) 


where f is the effective focal length of the light collection system. In 


(101) 
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theory, therefore, we could calculate T'(£, 7; At, An) directly by ex- 
pressing the total slit-plane field as a sum over the E,(é, 7; K;) and 
then performing the statistical average indicated in eq. (100). In the 
absence of aberrations or other imaging defects, this direct method 
represents the simplest approach. For perfect imaging, the individual 
plane-wave scattered field patterns are essentially nonoverlapping at 
the observation plane and the coherence function is effectively domi- 
nated by the contribution of a single K; term. However, when imaging 
errors produce a significant overlap of the E.(£, 7; K,;) at the obser- 
vation plane, as is the case for the present apparatus, then obtaining 
the analytical form of 7'(£, »; Aé, An) by the direct method becomes 
a dificult mathematical problem. 

The second alternative approach involves a direct calculation of 
the far-field coherence function TJ'o(r, 9) from which T'(é, 7; A&, An) 1s 
obtained by using the fundamental laws that govern the ‘‘propagation’’ 
of mutual coherence in an optical system. This latter method is 
generally the more useful when the light-collection system departs 
significantly from ideal imaging. 

For the scattering angles relevant to the MK VI instrument, the 
two-dimensional ‘‘phase sheet’? model of the scattering sample may 
be used to simplify the calculation of To(r, 0). For this two-dimensional 
model object, the reference sphere coherence function is given by the 
van Cittert-Zernike theorem® as 


To(8, 938 — We — ¢') 


2a f [dxdy | Boa, y) |? 
s 


where both r and o have been expressed in the cartesian angular 
coordinates @ and ¢. In eq. (103), the factor | £o(z, y)|? is the ilumi- 
nation function of the object, in our case the ‘‘phase-sheet’’ sample. 
The surface integral is to be taken over the entire (x, y) plane or over 
the open aperture of the object, as appropriate. It should be noted 
that the van Cittert-Zernike theorem will hold as long as the pertur- 
bations in the scattering “phase sheet” have a correlation distance, 
which is short compared to the characteristic spatial dimensions of 
| Ho(x, y)|®. This condition is, in general, well satisfied in the typical 
scattering experiment. 

At small angles, where eq. (103) is valid, the far-field spatial co- 
herence function is independent of the absolute angular position of 
either observation point and depends only on the separations (6 — 6’) 
and (¢ — g’). In terms of these difference variables, To(---) is just 
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the normalized Fourier transform of the source intensity. As such, it 
bears an extremely close resemblance to the instrumental profile 
calculated in Sections 3.1 and 3.2. 

For the MK VI instrument, the illumination function 1s the gaussian 





| Eo(a, y) |? = ER exp (~- = v) (104) 


o 


and we have from eq. (103) 
bx /2 
i exp (—2?/o?) exp [tko(@ — 6’) x |dx 


br /2 
bz /2 
J exp (—2?/o?)dx 


—bz/2 


T(6, 939 -—O,~p— oY) = 


by /2 
[. , oxP (—y?/o*) exp [iko(e — ¢’)y]dy 
x Saba? _______________ (105) 
[mm ex (—v 709 
—by /2 


where b, and by are the aperture dimensions at the scattering sample. 
As is evident from eq. (105), the coherence function factors for the 
case of gaussian illumination and we can write 


T (0, 938 - 6, 9 — ¢’) = To(A8)To(Ag), 


where A@ = 6 — 6’ and Ay = go — yg’. The functions 7 9(A@) and 
To(A¢) are given by the appropriate integrals in eq. (105). Each of 
these integrals is a finite domain Fourier transform of a gaussian kernel 
of the type considered in detail in Section 3.1 with respect to aper- 
ture apodization and vignetting. The only difference is that in eq. (105), 
the “intensity,” exp (—2?/o?), replaces the “‘field,” exp (—2?/207), 
which appeared in the diffraction calculations. It is not hard to show 
that the factored coherence functions 75(A@) and Ty(A¢@) are identical 
to the normalized intensity profiles of Fig. 9 if one uses the corre- 
spondence 





I(y/2) 
[Po = Ty (106) 
Given the form of the far-field reference sphere function 7’, we must 
now determine the relationship between 7’) and the desired slit-plane 
correlation function. 

One of the fundamental results of coherence theory is that second- 
order mutual coherence functions, such as T'(---), propagate according 
to the wave equations as “‘field’” variables. That is, once T(---) 1s 
specified on any surface in an optical system, its form on any other 
surface in the system may be found by treating the coherence function 
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as one would any electric field distribution. Therefore, all of the usual 
wave-diffraction and/or geometrical-optics approaches used to analyze 
wave-front propagation in an optical system are directly applicable to 
the coherence function. | 

For the MK VI apparatus, the coherence ‘‘field”’ described by 7’ 
is identical to the far-field electric field that describes the instrument’s 
directly transmitted probe beam, except for a numerical change in 
the beam-width parameter o. The effective beam width which charac- 
terizes the mutual coherence “‘field,’”’ o7, is related to the actual beam 
width of the instrument, o*, by the result 


2 _ (o*)? 
oT = “9 (107) 


Except for this numerical change, the diffraction and aberration re- 
sults of Sections 3.1 and 3.2 may be used intact to describe the slit- 
plane coherence function. In terms of angular coordinates at the slit 
and the widths 66(1/e) and 6¢(1/e), which were used to characterize 
the instrumental profile, we have easily 


T(6, 9;0-@,¢— ¢) = T'(A0)T’ (Ag) 
7 (0 — 6’)? (e— ¢)? |. 
as |- 4562(1/e) a |- 46 g (1/e) 
(108) 


In the slit plane, as was the case on the surface of the far-field reference 
sphere, the slit-plane coherence functions are related to the intensity 
profile of the transmitted beam by the transformation 


iry)| = Oe. 





Equation (108) is the basic result which may be used to evaluate the 
scattered or stray-light power-per-coherence region or estimate the 
number of coherence regions encompassed by a particular choice of 
main-slit size. For example, given the slit-plane scattered intensity 
|£,(8, »)|2, we can form the weighted integral 


AAO). f [addy |E0, eT’ — #)T'(e — €)F, (109) 


which is a useful measure of the power-per-coherence solid angle as 
measured at the slit.2327 In general, |.(6’, 9’) |? is slowly varying 
over the angular range where [7’(@ — 6’)T’(¢ — ¢’) |? is nonvanishing 
and can be removed from the integral to give 


dP, (6, ¢) 


= |H,(6, ¢) |*A@conA goon, (110) 
dQeon 
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where the mean full-width coherence angles, Adcox and Agcou, are 
defined by the integrals 


Qcon = AbconA vcou 
[rae Pa(as) [LT (Ae) Pa(ae). (111) 


Combining eqs. (108) and (109) gives for the MK VI instrument 


Abcon = V2750(1/e) = 24.1 urad 
Agcon = V2r5~(1/e) = 140.4 prad 


and eq. (110) becomes 


do, @) = |E.(6, y) |2W2780(1/e) V25 o(1/e). (113) 
COH 


(112) 


Earlier in this section, we obtained an expression for | ,(6, ¢) |? based 
on a plane-wave-mode expansion of the scattering perturbations. That 
result may be used in eq. (113) to yield a relationship between the 
observed scattered power-per-coherence solid angle and the scattered 
power-per-K; mode. From eqs. (99) and (113), we find 


dP.(0;, gi) _ V2n80(1/e) | V2m5 p(1/e) 5 pe 
ie py oye ES) 
The product of the correction factors 
(260 (1/e) . 276 0(1/e) 
(A/bz) (A/by) 


is a rough measure of the number of modes that contribute to the 
power observed in a single coherence region at the slit plane, while the 
individual terms indicate the extent of the multiple mode contribution 
in the 6 and ¢ directions. For the MK VI instrument at full aperture, 
the numerical values of the correction factors are 


V2 80(1/e) _ a 

(X/bs) | (115) 
N20 d¢(1/e) — 140 

(A/by) ~ 


The results given in eqs. (99), (113), and (114) together with the 
known form of the instrumental profile may be combined in various 
ways to calculate normalized scattering cross sections from measured 
slit-plane intensities. One important calculation of this type is to 
express the observed stray-light levels in the MK VI apparatus in 
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terms of an equivalent scattering cross section. For the experimental 
profile curves shown in Fig. 4, we may write down an analytical ex- 
pression for the measured stray-light photocurrent, 7(@), as 

=a fo f°" \1.06, 9) |2d0d9, (116) 

—A9g,/2 J—Aggz/2 

where |,.(6, ¢)|? is the stray-light intensity at the slit plane and 
A@sz, and Agsr specify the full-width slit dimensions in angular units. 
The proportionality factor a relates the photocurrent to the optical 
power passed by the slit and includes the detector quantum efficiency, 
light-collection losses, etc. If the intensity |F.:(@, ¢)|? is relatively 
constant over the slit aperture, we have simply 


1(0) = a|£..(0, 0) |?A@grA os, (117) 


where we have assumed that Aggrz is situated symmetrically around 
g = 0. Combining this result with eq. (110) gives the relation between 
the measured photocurrent and the stray-light power-per-coherence 
solid angle as 


aP (6,0) A@sz Agst 
dQcou A@cou Agcou 





1(0) = a (118) 
To eliminate the unknown proportionality constant a, we make use of 
photocurrent observed at 6 = O, the peak of the directly transmitted 
beam. Given the normalized slit-plane intensity profile of the direct 
beam, (6, ¢)/I(0, 0), we can calculate the fraction of the total beam 
power, Po, passed by the slit at 6 = 0 as 


A@sti2 pAgsy/2 T(@, ¢) 








déd 
-nganierd oan O00). a 
= 7, (119) 
lf I(6, ¢) dody 
all Q I(0, 0) 
where for the MK VI apparatus we have 
I(8, ¢) : 





T0,0) “*P | — wave |? [- sate | 


The numerical value of the error function integrals in eq. (119) could 
be obtained from tabulated results for particular values of A@és;, and 
Avgzt; however, in the present case where the slit dimensions satisfy 
the inequalities 


Asi K 66(1/e) 


Avst > 5¢(1/e), ne) 
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we have the more useful analytical result 


Y= — 121 
~~ ¥60(1/e) | (121) 
The measured peak photocurrent, 7(0), is then 
Aésr 
(0) = aP) ———_.. 
eae V750(1/e) (122) 
Dividing eq. (118) by eq. (122) gives the useful result 
i(0) _ 1 dPu(0,0) 1 Ags: 7 
(0) Po dQcon VW iecon (123) 


If desired, the quantity dP,:(0, 0)/dQcoy can be replaced with the 
stray-light power per mode, P;,(K,;), by using eq. (114). This gives the 
very useful relationship 


10) _ ®s(Ky) V2e 86(1/e) Agsr 
i) ~~ @/b.)  O/by) oe 


IV. EMPIRICAL OBSERVATIONS ON THE STRAY-LIGHT BEHAVIOR 
OF OPTICAL ELEMENTS AT VERY SMALL ANGLES 

Very little information of a quantitative nature is available con- 
cerning the imperfection scattering of optical elements at very small 
angles. As a result, the design and testing process leading to the 
present vsA instrument involved a significant amount of trial and error 
evaluation of various optical systems in a search for the desired stray- 
light performance. During this process, a certain amount of empirical 
information was obtained relating to the imperfection-scattering ques- 
tion. This section presents a brief discussion of these observations and 
their influence on the configuration adopted for the MK VI instrument. 


4.1 Reflecting versus refracting optics 


It is clear from a comparison of Figs. 1 and 6 that the implementation 
of a vsA scattering instrument using lenses would be significantly less 
involved than the MK VI off-axis mirror arrangement. The refracting 
system also has the advantage of strictly zero off-axis aberrations 
(coma, astigmatism, and distortion), although a “‘best form”’ single- 
element lens does have eight times the spherical aberration of an 
equivalent spherical mirror.*” In fact, the earliest version of the present 
apparatus utilized precisely the kind of ‘‘straight-through” lens system 
illustrated in Fig. 6. This arrangement was abandoned because of 
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two problems: 


(2) The presence of Newton’s interference fringes crossing the 
illuminated field. 
(iz) An excessive stray-light background. 


The first problem arises because of the partial reflectivity of the two 
lens surfaces and can be solved to some extent through the use of anti- 
reflection (AR) coatings. However, even the best antireflection coated 
lens will form far-field Newton fringes with an integrated inten- 
sity of about + percent of the incident beam power. This fact makes 
the refracting components generally unacceptable in a vSA system. 
The presence of these extraneous reflections and their associated 
interference fringes creates an intense fixed-pattern nuisance back- 
ground which can make it impossible to observe the angular de- 
pendence of the sample scattered light. The stray-light background 
problem is a manifestation of small-angle scattering at the lens which 
may originate from three possible sources: 


(zt) Lens surface ‘‘roughness” or nonconformity (at least two 
surfaces). 

(22) Index of refraction inhomogeneity in the lens bulk material. 

(422) AR coating thickness nonuniformity (at least two surfaces). 


By way of comparison, the possible sources of imperfection scattering 
from a first-surface reflector are 


(7) Mirror surface ‘‘roughness” or nonconformity (one surface). 
(77) Reflective coating(s) thickness nonuniformity. 
(122) Reflective coating reflectivity nonuniformity. 


From a theoretical standpoint, one should be able to evaluate the 
seriousness of each of these defects a priori by calculating the surface 
and/or bulk inhomogeneity scattering. This calculation is straight- 
forward if one has available the spatial form of the roughness in terms 
of the spatial correlation function and the rms roughness amplitude. 
The effect of roughness or inhomogeneity is to impose a spatially 
random-phase perturbation in the optical path. The scattering that 
takes place as a result of this perturbation can be calculated via the 
same ‘‘phase-object’’ approach which is used for the primary scattering 
sample (see Section 2.6). The stray-light intensity observed at some 
specified scattering angles @ and ¢ is given by the Fourier transform 
of the roughness correlation function at a wave vector |K| = 27/A 
satisfying the appropriate small-angle kinematic conditions. Un- 
fortunately, the roughness wavelengths corresponding to the angular 
range of interest here (10-7 em @ A = 1 cm) are determined by a 
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spatial region of the roughness correlation function about which very 
little 1s presently known. This wavelength regime presents difficult 
measurement problems and is generally not probed by conventional 
roughness-testing techniques. The data that is available comes from 
two measurement techniques that tend to flank this regime on the 
short and long wavelength sides: 


(<) The FEco* interferometer and allied methods!” that exhibit 
good surface deviation resolution, 1 A to 10 A, but are useful 
only at short wavelengths (A = 1000 A). 

(772) Conventional “surface-conformity” techniques such as the 
Foucault knife-edge and TWwyman-Green interferometer tests 
that are useful primarily at longer roughness wavelengths 
(0.1 cm to 100 em) and which exhibit relatively poor surface 
deviation resolution (50 A — 2000 A). 


The stray-light measurements that were made during the course of 
the evolution of the present instrument provided the most sensitive 
roughness and inhomogeneity test for this awkward wavelength range. 
It was found experimentally that, for lenses and mirrors of the same 
fraction of the ‘state-of-the-art,’ the stray-light level of a refracting 
instrument was roughly 20 times that of its reflecting counterpart. In 
neither case did the vsa stray-light level correlate well with known 
short wavelength roughness and inhomogeneity data. Both types of 
components exhibited a spatial roughness spectrum that was strongly 
enhanced at long wavelengths. This enhancement did not appear to 
depend as strongly on the ‘‘surface-figure’’ of the component as one 
might be led to expect by qualitative theoretical arguments. 

Comparisons were also made between mirror components having 
multilayer dielectric coatings and those with a conventional Si0- 
protected aluminized surface. The aluminized coatings can suffer from 
a spatially varying reflectivity caused by surface oxidation while high- 
reflectivity dielectric films tend to have a significantly smaller reflec- 
tivity modulation. However, the stray-light measurements showed no 
significant difference between the two types of coatings on similar 
“quality” substrates. Apparently the cumulative roughness of the 
greater number of dielectric layers offsets the dielectric coating’s 
potential advantage. 


4.2 Main scanning slit 


Another major contributor to the stray-light level in earlier versions 
of the MK VI apparatus was the main angle-scanning slit. The slit 
selected for this application is a commercial Spex unit normally used 


* Fringes of Equal Chromatic Order. 
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as an intermediate or exit slit on a double-grating spectrometer known 
for its low stray-light background. This fact not withstanding, severe 
stray-light problems were encountered in predecessors of the MK VI 
that had this slit located directly at the focal plane of the collecting 
mirror (mirror Mg, shown in Fig. 1). The origin of this problem was 
traced to scattering of the direct beam by the beveled surfaces of the 
slit jaws and to quasi-specular reflection from the slightly flattened 
and rounded jaw edges. This source of background by itself was of 
sufficient intensity to completely swamp the sum total of all other 
stray-light sources in the instrument. 

This problem was solved in the MK VI apparatus by occulting the 
directly transmitted beam, before it reached the main scanning slit, 
with a precision knife-edge fabricated of highly attenuating black glass 
plate. The use of glass instead of metal permits the edge defining 
surfaces to be optically polished without cold flaw and rounding. In 
addition, the included angle formed by the edge surfaces is made 
obtuse, rather than the acute angle normally used, to avoid the 
feathering problems and surface irregularity enhancement associated 
with small included angles. The salient geometrical features of the 
knife-edge are illustrated in Fig. 20. The actual occulting edge is formed 
by a single beveling operation on polished flat stock and is oriented in 
use such that an incoming ray strikes the beveled face at the quasi- 
Brewster angle. The beveling angle is chosen so that the ray which is 
refracted into the plate travels parallel to the plate surfaces and 1s 
totally absorbed. 

The improvement in stray-light level obtained by using the knife-edge 
to occult the direct beam, rather than relying solely on the main scan- 
ning slit, can be seen in Figs. 3 and 4. The improvement amounts to 
roughly an order of magnitude over the angular range of interest. 


4.3 Aberration corrections and stray light 


The reader familiar with optical system design will realize that the 
aberrations present in the MK VI instrument could be ‘‘corrected’’ 
using well-known techniques. However, the application of these cor- 
rection methods has two drawbacks: cost and reduced stray-light 
performance. The simplest corrective measures, those which add the 
fewest number of optical elements to the basic apparatus, entail the 
use of off-axis fabricated, aspheric reflecting and/or refracting elements, 
These types of elements are, in general, exceedingly costly to fabricate. 
More sophisticated aberration-corrective designs, utilizing only spheri- 
cal optics, require a larger number of additional elements. In either 
case, of course, the presence of additional optical surfaces means 
degraded stray-light performance. Furthermore, any corrective design 
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Fig. 20—-Geometrical features of the occulting knife-edge used in the MK VI 
apparatus. 


relying on the use of refracting elements will be further penalized by 
the excessive small-angle stray light which these elements generate. 


APPENDIX A 
Finite Slit-Width Effects in the Scanning of Gaussian Intensity Profiles 


When a gaussian focal-plane profile is scanned by a finite-width slit, 
the transmitted power is proportional to the integral 


Eo+A 
T(Eo, A) = fi exp [—#/88(1/e) ds, (125) 


where £9 gives the position of the center of a slit whose width is 2A. 
By writing the spatial coordinate & as 


one eae § (126) 


and making a change of variable, we may put eq, (125) into the form 
J (&o, A) = exp L— &/6#(1/e)] 
A 

x fo exp [— (28 + (°)/82(1/e) Mg. (127) 
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Table IX—Angular displacement, 4), at which an unbroadened 
gaussian and a slit-broadened gaussian reach specified 
fractions of peak intensity 





Ao Ao 
J (80, A) Unbroadened . Slit Broadened A 
J (0, A) Ae = Oprad Ae = 1 prad Increase 
(urad) (urad) 

J 0 0 — 
1/2 7.993 8.022 0.36 
l/e 9.600 9.635 0.36 
107! 14.567 14.620 0.36 
10-2 20.601 20.675 0.36 
10-3 25.231 25.322 . 0.36 
1074 29.135 . 29.239 0.36 
1075 32.573 32.690 0.36 
10-6 35.682 35.810 0.36 
1077 38.541 38.678 0.36 
1078 41.203 41.348 0.36 


For reasonably small values of the ratio [A/6£(1/e) |, the gaussian term 
in the integrand of eq. (127) may be approximated by the leading 
term in its Taylor’s series expansion 


exp [—{?/s#(1/e)] = 1 — eH Poe 


with a maximum error exp [ — A?/6£(1/e) |. Within this approximation, 
the remaining integral can be calculated in a straightforward manner 
to give 


_ - sinh [2¢04/8%(1/e)] | 
Since the function (sinh x)/z tends to unity as x goes to zero, the 
normalized slit-broadened profile is 


J(&, A) _ Apes sinh [2§A/8#(1/e) ] 
Tea) ~ 8? C~ BSE) REE a } 29 


or its equivalent written in terms of the scattering angle 6 = &/f. 
Clearly, in the limit A — 0, eq. (129) describes the correct unbroadened 
gaussian. For A ~ 0, the principal effect of the (sinh x)/z correction 
term is to push up the tails of the profile while leaving the peak of the 
gaussian relatively unaffected. A good quantitative feeling for the 
nature of this correction may be obtained by solving for the off-zero 
displacements, £, at which the broadened and unbroadened profiles 
reach specified fractions of their peak intensity. These & values then 
specify the profile half-widths at the corresponding intensity level. 
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For the curves presented in Section II, the relevant numerical 
parameters, expressed in angular units, are: 


60(1/e) = (1/f)sé(1/e) = 9.6 wrad 
Ay = (1/f)A 1.0 urad. 


Table IX gives the calculated half-width values 0) = (£/f) obtained 
from eq. (129) for various choices of the ratio J(6o, A)/J (0, A). For 
comparison, the table also lists the corresponding half-widths of the 
uncorrected gaussian, and the percentage of line-width increase caused 
by the slit-width correction. As is evident from these results, the 
effect of the (1/z) sinh x correction term is to alter the gaussian profile 
in such a way that the observed half-widths are an essentially constant 
percentage larger than the true values. 


APPENDIX B 


Numerical Evaluation of the Diffraction Profile of Apertured 
Gaussian Illumination 


Equation (47) gives the basic integral for the truncated gaussian 
diffraction profile as 


E(®) = ace [- exp (—22/o2) exp [é(2m/Pdo)év]dr. (130) 


This expression may be put into a form more suited to numerical 
computation as follows. We write the exp 7(---) term as 


exp [7(27/fdo) Ee] = cos Kx + isin Ke 
with 
2rE 270 


k= = —_— 
fro Xo 


(131) 


and note that the sin Kx integral vanishes by symmetry. Next by a 
change of variable 


r= ae (132) 
we obtain 
w=] 
E(é) = UES. cos cw exp (—a?w?)dw, (133) 
(fro)? J w=0 
where a and ¢ are defined as 
_ Kb _ (2r&\6 _ xb6 ia Or 
ce = (RE) o Bo Oe) 


The gaussian in the integrand is now expressed in terms of its Taylor’s 
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series expansion 


2 (—1)"(a"a")* 
> n! 


exp (—a’z?) = 
to give E'(£) as 


by =a)" 


E(&) = fro) = ee fo = w?” cos (cw) dw. (135) 





Equation (135) forms the basis for the numerical computation of the 
profiles. 

A simple closed-function form for the w integrals in eq. (135) does 
not exist; however, recursive relations among these integrals can be 
found from the standard integrals 


1 1 1 
/ x" cos cx dx = 2 — 7 | x”! sin cx dx (136) 
0 C C J0 
and 
1 1 
J 2" gin cr dx = — SO8° ++ (ona!) I x” eos cx dx. (1387) 
0 C C 0 
Defining 


1 
Dim (ce) =a x” cos cx dx, (138) 


we easily obtain the following recursion formulae from eqs. (1386) 
and (137): 


(m + 3)Lmi2(c) = (a= ) sinc + Co as) COS € 


Cc? 
— ee F 8n FP) Fm + 1)Em()] (139) 


(m — 1) Lm—2(c) = (<) sinc + cos c¢ 


— EH Lm + DEm(o)I (140) 


From eq. (136), we also have for m = 0 


Inc 


Lo(e) = 





(141) 


In terms of the Ln(c), the expression for the diffracted field takes the 
series form 





E(t) = bE'o z& 1)”(a?)"Len(c) 


ho) n! _ 
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and the desired normalized intensity profile is 


© (—1)"(a2)"Loa(o) |? 
(8) | = | 


Se ua 143) 
(0) ~ [ = Corey — 
ee 
n=0 nN: 


The zero argument L,,’s can be written down explicitly from eq. 
(138), vez. 





(144) 


For the numerical results reported here, the series in eq. (148) were 
truncated at some n = nmax by testing the value of (1/7!) (a?)"Zen(0) 
and terminating when this quantity was smaller than some chosen 
convergence criterion, «. In the present case, e was set at « = 107”. 
For the largest (b/c) value, (b/c) = 8.33, where the gaussian kernel 
of eq. (183) is 

exp (—a?w?) = exp (—8.68w?), 
49 terms in the series were required for convergence. 


For each individual pair of values for c and nmax, the required string 
of Lm’s are generated by two subroutine programs. 


B.1 Subroutine No. 1,¢c <1 
When the quantity c = (Kb/2) is less than one, the L,,’s are obtained 
by the following procedure. 


(7) Calculate Im(c) for m = 2nmax directly from the defining 
equation (188), using the Taylor expansion for cos cx to write 


= oe 2n-+m 
Line) = ~, ant to W2rtmdw 
es) (—1)"c?” 


x Gayla + 2a +1) (145) 


(77) Truncate the sum in eq. (145) when c?”/(2n)!(m + 2n + 1) is 
less than 107, 

(277) Use this result for Lny,x(c) to obtain the required L,,’s via 
the backward recursion formula, eq. (140). 


B.2 Subroutine No. 2,c>1 


When the quantity c = (Kb/2) is greater than 1, the L,,’s are found 
by a two-part procedure that depends on the value of nmax. 


(t) For m = 2n values for which the inequality m = 2n < ¢ is 
satisfied, use Li(c) = (sin c)/c and the forward recursion rela- 
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tion, eq. (140). If 2nmax is less than c this first step gives 
all required Lm’s. 
(22) If 2nmax is greater than c, set 


COS C 
Lim (ce) RY ——— 


m+il 


for some m >> 2nmax and work backward using recursion relation eq. 
(140). The calculated string of L,,’s is joined onto the forward recursion 
values from step (z) for some mc and then renormalized. 

This rather elaborate procedure for calculating the Lm’s is made 
necessary by the rapid accumulation of numerical round-off errors 
which arise in the repetitive application of the basic recursion formulae. 
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We present an exact description of scattering of an incident plane wave 
with TE-polarization at an interface between two dielectric media that is 
deformed by a grating with triangularly shaped teeth. The theory employs 
an expansion in plane waves outside of the grating region and describes 
the field in the grating region as a double Fourier series expansion. The 
results of this theory are represented graphically. That blazing provides 
substantial discrimination of the scattering process in favor of beams 
scattered into one or the other of the two media is shown. The exact theory 
as used to check an approximation for the effective reflection plane that is 
useful for future applications of the theory to scattering by gratings of 
guided waves tn thin-film waveguides. 


I. INTRODUCTION 


This study of dielectric sawtooth gratings with deep grooves serves 
several purposes. Its principal aim is to investigate a particular analyti- 
cal method for describing deep gratings with the view of applying it 
(at a later time) to waveguide-grating couplers. However, even without 
the added complication of one more dielectric interface that charac- 
terizes the waveguide problem, an examination of the response of 
dielectric gratings with deep grooves to a plane wave, incident at an 
angle that would lead to total internal reflection at the corresponding 
smooth surface, can teach us much about the expected behavior of 
waveguide-grating couplers. 

The literature on the electromagnetic theory of diffraction gratings 
is vast. However, most papers are limited to discussions of metallic 
gratings,!? and only a few papers mention dielectric gratings with 
sawtooth-shaped grooves and plane waves incident at angles larger 
than the critical angle for total internal reflection.? When it comes to 
providing numerical information for a given particular case, each 
worker must write a computer program to solve the problem at hand, 
since no publication can cover all conceivable cases in graphical form. 

1295 


Developing the computer program for the study of deep gratings was 
one of the aims of this work. 

Our method for treating TE-wave interaction with deep dielectric 
sawtooth gratings is basically simple and exact. We express the field 
above and below the grating as a series of plane waves using the periodic- 
ity imposed by the grating. In the grating region, the field is expressed 
as a double Fourier series expansion whose terms are not individually 
solutions of the wave equation. The unknown coefficients entering the 
various serles expansions are determined by the requirement that the 
field in the grating region must be a solution of the wave equation and 
by enforcing the proper boundary conditions along two mathematical 
planes just above and below the grating region. By varying the number 
of terms used in the series expansions, it was found that the series 
converge very well, and good accuracy is obtained with relatively few 
terms. However, the required number of terms increases with increasing 
depth of the sawtooth grating. 

The simple grating problem described here has the advantage that 
only an inhomogeneous equation system needs to be solved. Since the 
problem does not contain unknown eigenvalues, no search for suitable 
eigenvalue conditions is required. The exact solution of the correspond- 
ing waveguide problem would lead to an eigenvalue equation. A very 
large determinant with complex coefficients would have to be forced 
to vanish by proper choice of the propagation constant of the leaky 
wave inside the guide, one of whose interfaces between core and 
cladding is formed by the grating. The simple grating furnishes im- 
portant information about the phase shift suffered by the reflected 
plane waves. This information can be used to estimate the eigenvalues 
of the modes inside of the waveguide with a grating on one of its 
interfaces. This information is useful for finding approximate solutions 
of the waveguide grating problem without the need for solving a 
costly and time-consuming eigenvalue problem. 

For shallow gratings, our theory is in complete agreement with 
perturbation theory. Some of the features of deeper gratings with 
groove depth on the order of the wavelength can be explained by 
geometrical optics coupled with simple grating conditions. The ray 
paths in deep gratings (groove depth larger than the wavelength) are 
so complicated that an explanation of maxima or minima in terms of 
geometrical optics fails. 

The sawtooth-shaped interface deformation is a blazed grating. It 
has the advantage that its shape can be adjusted to enhance certain 
grating orders. In particular, it is possible to let a high grating order 
predominate over lower orders. Furthermore, the grating shape can 
be used to favor scattering into the air space above the grating or, cor- 
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Fig. 1—Triangularly shaped dielectric grating as an interface between the two 
media with index 7 and nz. (This figure defines grating parameters and incident and 
scattered beams.) 


respondingly, to favor scattering down into the higher dielectric region, 
the substrate, from which the incident wave impinged on the grating. 
This preferential scattering behavior is very useful for the construction 
of grating couplers. Additional grating responses, other than those used 
for the coupling beam, decrease the overall efficiency of a grating 
coupler. If unwanted grating lobes can be suppressed by properly 
shaping the grating teeth, higher coupling efficiencies are obtainable. 
Gratings that show strong asymmetry in favor of a certain grating 
order provide also high-reflection losses for the zero-order grating lobe 
(that would correspond to the guided mode field of a waveguide). The 
waveguide mode thus would decay rapidly over a few periods of the 
zig-zag path of the guided ray. This means that high-efficiency grating 
couplers based on this principle would have to be very short. 


1.1 Theory of the dielectric sawtooth grating 

Figure 1 shows the geometry of our sawtooth grating. A ray labeled 
zis incident from the medium with refractive index n, on the dielectric 
interface with the medium nz whose shape is a sequence of sawteeth. 
The specularly reflected beam is labeled 0. Also shown are two scat- 
tered beams labeled 1 which escape into the medium with index ne. 
(subsequently to be called the air space) and into the medium with 
index n; (subsequently to be called the substrate). The grating period 
is D; Dis the distance along the base of each sawtooth from its begin- 
ning to the point underneath its peak. The grating amplitude is defined 
as 2a. 

We consider only TE-waves with the electric field component EF, and 
the magnetic field components? 
tony (1) 


Who OZ 





H, = — 
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and 
ah OEe, 
Wio OX 





A, = (2) 
(w and yp, are, respectively, the angular frequency of the wave and the 
magnetic permeability of vacuum.) The grating is infinitely extended 
in y direction, so that all y-derivatives vanish. The field components 
E,, H,, and H, do not exist. The periodicity of the infinitely extended 
(in z direction) grating forces the electromagnetic field to be of the 
following form: 


Ey = e~ tBiz | Aspen - » Aneintei@xiDIea for v < 0, (3) 


m=— 0 


and 


v2] 
By = ee Do Caen tenet eri a es for x = 2a. (4) 


m=— 0 


Since HL, must satisfy the wave equation 





Ch, , PH, a ae 
I op ee (5) 
with 
k= = = WV Eolio (6) 


(A. = free space wavelength, e«. = dielectric permittivity of vacuum, 
nm = ny, Or Ne refractive index of the dielectric medium), the parameters 
appearing in (3) and (4) must have the form, 


om = | nfl? ~ (8. — 77 m)"] (7) 


Pm = Ez — («. = my | (8) 


A; is the amplitude of the incident wave with propagation constant 


and 


B; = nik cos 0. (9) 


The term propagation constant is used here in the same sense as In a 
waveguide; it 1s actually the zg component of the plane wave propaga- 
tion vector. 

Note that the superposition of plane waves (3) and (4) was chosen 
so that the traveling parts of the wave move away from the grating 
with the exception of the incident wave of amplitude A” [the time 
dependence is understood to be exp (tw?) |. It is clear that only a small 
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number of waves in the field expansions actually propagate in x direc- 
tion, because almost all terms of the form (7) and (8) are imaginary. 
The signs of the imaginary quantities must be chosen, so that the 
evanescent fields decay in the direction away from the grating, 


on = —tlon|, Pm — 4 pat: (10) 


Every term in the expansion (38) and (4) is a solution of the wave 
equation, but a similar expansion cannot be written down for the field 
in the grating region 0 S 2 S 2a. Instead, we simply use a doubly 
infinite Fourier series. 


CO 
E, —_ e tBiz > Byme? tl et Cal D mz 0 < ey < 2a. (11) 


n,m=—0 


Except for the phase factor exp(—78;z), the field solution is periodic 
in z with period D. This periodicity is a very important feature of the 
field solution and is imposed by the periodicity of the grating. The 
function (11) is also periodic in x direction with period length 2b. This 
periodicity is quite arbitrary. It would appear natural to let b = a. 
However, this choice of b would force the field to have exactly the same 
values at « = 0 and x = 2a, which is physically unreasonable. For 
this reason, we must allow b to be arbitrary, but use 6 >a. As a 
practical matter b = v2a has been used for the numerical calculations 
in the hope that this choice would facilitate the convergence of the 
series. Clearly, 6 should not be made too large and, of course, it must 
not be smaller than a. . 

It now remains to determine the expansion coefficients Am, Cm, 
and Bim. This is accomplished by substituting (11) into the wave 
equation (5), multiplying the resulting equation with exp(—imn x/b) 
exp(—72am z/D) and integrating over z from 0 to D and over x from 
0 to 2a. Continuity of the fields at the planes x = 0 and x = 2a re- 
quires us to force HL, and its z-derivative to be continuous at these 
planes. After elimination of A,, and C,,, from the equation systems, we 
are left with the following three infinite simultaneous equations: 


Ds (on + 5”) Brmet@ridna = (12) 
> (» = 57] Big 2 On AO Bue: (13) 
oa) arn’ 9 : 
>: IN anon = Db a Bm! NM yn Oman Barn = 0. (14) 


dmmt 18 Kronecker’s delta symbol. In (12) and (13) m is allowed to be 
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any integer, and, similarly, m and m are allowed to be any integer in 
(14). The first two equations stem from the boundary conditions, 
while (14) expresses the requirement that the field expansion (11) 
satisfy the wave equation (5). The three sets of infinite equations (12) 
through (14) are used to express Bnm in terms of A{?. The coefficients 
N nt—n.m'—m and M,,n are listed in the Appendix; 6, is defined as 


2 
Bn = Bi — Fm. (15) 


The amplitude coefficients A, and C, are obtained in terms of Bum as 
follows: 


I 


A. ( > Bam) — AP ine (16) 


5 Bametlom+ (x1) n]2a (17) 


n=—0 


Cm 


The power of the incident wave flowing through an element of unit 
area parallel to the x direction is given as 





To i 
BSS oa | As? |?. (18) 
It is convenient to express the power carried away by the scattered 
beams in terms of the power of the incident beam. For the grating 
orders carrying power into the air space, we obtain the relative power 
from 
AP ie _ Pm 1Cm |? 
PP. 6, [AM]? 


Similarly, we obtain the relative power carried into the substrate, 


AP ns om |Am|? 
“P,P ia 








(19) 


Let us close this section with a few remarks about the numcrical 
solution of the equation systems (12) through (14). As mentioned 
above, the terms in the series expansions (3) and (4) represent traveling 
as well as evanescent waves. It is clear that all terms corresponding to 
traveling waves must be included in the truncated series expansions 
used for approximate numerical solutions of the problem. According 
to (7), propagating grating orders are associated with m values in the 
interval 

E (B; — mk) | <m< | (nik + 6) | (21) 


The label ‘‘int’’ is a reminder that the integer, whose absolute value is 
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just smaller than the value inside of the bracket, must be taken. Those 
terms in the series expansion (3) whose m-values lie outside the interval 
(21) belong to evanescent waves that do not carry away power. As 
a practical matter, we found that sufficient accuracy is obtained if just 
one—or at most a few—evanescent waves on each side of the interval 
(21) are included in the series expansions. The terms in the expansion 
(11) cannot be interpreted as traveling or evanescent waves. The sum 
over ™m is, of course, intimately related to the m-summations in (3) 
and (4), and an equal number of terms must be taken in all m-summa- 
tions. We found that the n-summation in (11) converges more slowly, 
so that usually more terms are required in this series. In all numerical 
calculations whose discussions follow, we never used more than 11 
terms in the n-summation, and often as few as 7 terms proved to be 
sufficient, if the grating amplitude remained below 2a/\, = 0.5. The 
total number of unknowns Brym in the equation system (12) through 
(14) is, of course, the product of the number of terms in both series 
expansions, n and m. For large values of 2a, for example for 2a/\, = 2, 
we used 66 unknowns Brym, and for 2a/A, < 0.5, 36 unknowns seemed 
to be sufficient. 

The fact that the equations stemming from the boundary conditions 
(12) and (13) must be included in the equation system to be solved 
prevents us from using an equal number of terms in the n’, m’ summa- 
tions of (14) and for the ‘‘free”’ indices n and m. Obviously, the number 
of m-values [the number of equations of the type (14) | that are used 
must be two less than the number of terms under the m-summation 
sign. 


1.2 Geometrical optics considerations 

Figure 1 shows the principal function of the diffraction grating. The 
incident plane wave breaks up into several components after striking 
the dielectric interface. The strongest wave leaving the grating region 
is usually the zero-order grating response that leaves in a direction 
corresponding to the specularly reflected beam at an ideal, smooth 
interface. Throughout this discussion, we assume that the incident 
.wave strikes the interface at an angle 6, that remains below the critical 
angle for total internal reflection at the unperturbed, smooth boundary. 
In addition to the incident and specularly reflected plane waves, a 
discrete number of scattered plane waves are generated. These waves 
emerge in directions that are defined by the condition that all scattered 
waves interfere constructively. The condition for such constructive 
interference is expressed by the relation 


2 
Bm = Bi — 5 Mm. (22) 
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The 6; and 8, are the z-components of the propagation vectors of the 
incident and scattered waves; m is a positive or negative integer. The 
angles @,, of scattered waves in medium 1 are obtained from (9) and 
(22) as follows, 


a Bm 
Om == arccos (= | . (23) 


and the corresponding scattering angles in medium 2 are 


Bn 
dm = arccos ( aa ): (24) 
The integer m defines the grating orders of the scattered beams. The 
direction of the specularly scattered plane-wave is obtained by using 
m = 0, and m = +1 gives the first grating orders, etc. 

The intensities of the scattered beams decrease with increasing 
grating order, if the grating amplitude 2a is much smaller than the 
wavelength. However, for deep gratings whose amplitude is comparable 
to, or larger than, the wavelength, higher grating orders may well 
predominate over lower grating orders. In particular, it is possible 
to predict maxima of scattered waves based on geometric optics con- 
siderations. Such maxima occur when the direction of a grating lobe 
defined by (23) or (24) coincides with the condition of specular reflec- 
tion of the incident beam on one of the facets of the grating teeth. 

Consider the situation shown in Fig. 2. Geometrical optics allow us 
to calculate the angle 6,. of the reflected wave as 


Bn = 2a -+- 6.. (25) 


If 6, simultaneously satisfies (23), a strong grating response may be 


) nr 





Fig. 2—Specular reflection from grating faces can be used to explain maxima of 
the grating lobes. 
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Fig. 3—Grating lobes in air can be enhanced by specular reflection from the grating 
teeth as shown. 


expected. From (23) and (25), we find the following condition for a, 





bol 


le B; 

arccos{ °° OD arceos ( - ) (26) 
—_—____— nik 

nik 


The grating angle a, is defined in terms of the other grating parameters 
as: 


2a 
a = arctan (55): (27) 
Maxima for grating responses into the air space can occur in many 
different ways. One possibility is depicted in Fig. 3. The geometric 
optics condition for 6, is computed in several steps. The refracted 
angle 6, follows from Snell’s law, 


6, = arccos (= COS (a2 + i) — Qe, (28) 
2 
and the angle of the mth grating response in air follows from 


— 6. (29) 


| ; 2a 
om = 2a, — 0, = 2 arctan (=) 


The conditions that must be satisfied by the grating parameters to 
achieve equality of (24) and (29) can be found by an iterative 
calculation. 

Geometric optics conditions leading to maxima of the grating re- 
sponse in air can be complicated in many ways. For example, the ray 
escaping into the air space shown in Fig. 3 may be intercepted by the 
grating tooth through which it just passed and may suffer further 
refraction. Another possibility is depicted in Fig. 4. The fact that many 
geometric optics conditions exist that may enhance the grating re- 
sponse in air makes it difficult to account for the maxima of the air 
lobes of deep gratings. 
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Fig. 4—The incident ray is bent back into the waveguide by successive refraction 
at the grating teeth. Scattering takes place at every tooth and may be enhanced if 
the specular reflection condition is satisfied. 


Figure 4 not only shows how a ray may escape through the grating 
following a complicated path, but it also indicates that the unscattered 
portion of the ray is diffracted back into medium 1 following a path 
that takes it inside of the grating. This geometric optics picture sug- 
gests that the effective penetration depth of the reflected light field 
can be estimated by geometric optics methods. For this purpose, we 
assume that the grating acts on the refracted ray as a graded-index 
medium with an index distribution 


Wa) = = Tah + = 8 (30) 





We can now use the WKB method to determine the phase of the wave 
that penetrates into the grating region. It is well-known that the wave 
penetrates into the graded-index medium until it reaches the turning 
point of ray optics at x = ¢. The phase of the reflected wave taken at 
the reference plane x = 0 is given by® 


; Sac% T 
_— aCe Bed a ee sn 
6=2 f ViA@)RF- Bide -—F= ae 5 (GW) 
We define an effective reference plane by assuming that the medium 
with index n, reaches into the grating region to a depth « = dapy. The 
phase of a wave reflected at this reference plane (that is assumed to 
consist of the index discontinuity from 7; to ne) 18 


@ = 2odapp — 2 arctan : (32) 


with 

y = (67 — ngk*)*. (33) 
The first term in (32) accounts for the phase shift caused by the round 
trip from « = 0 to x = dayp, and the second term is the additional 


phase shift on reflection from the index discontinuity.’ By equating 
(31) to (32), we obtain the following expression for the depth of the 
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effective reference surface inside of the grating: 


_ 4ac? 1 far Y 
dapp 3(n? — 2) ae (J arctan x | : (34) 
If the tangent of the phase angle y of the reflected wave is known, for 
example, from the numerical solution of the grating problem, the 
effective reference plane can be calculated from the expression 


2 arctan i —-ywotpr 

ay (35) 
Note that the approximation (34) holds only for gratings that are 
sufficiently thick so that the turning point of the rays is located deep 
enough inside the grating, so that the evanescent field beyond the 
turning point has decayed to insignificant values by the time it reaches 
the top of the grating. Furthermore, (34) is certain to represent the 
effective reflection plane better as the grating period is short. Numerical 
comparisons of the two expressions (34) and (35) will be presented in 
the next section. 


1.3 Examples and numerical evaluation 


The boundary between the two media with index n, and nz 1s 
described by the function 
2a 
= <p< 
Dy 0 = Dy, 
f() = (36) 


2a 
_ ne < < 
D p, P Z) D,s2=D 


} 


which is periodic in z with period D. Its Fourier coefficients are: 


aD2e—tm (Di/D) . Dy, 
Cm = immDi(D — D1) sin (am D ) : (37) 
This Fourier coefficient is important, because for the first order of 
perturbation theory!” the grating responses are proportional to |¢m|?. 
In the remainder of this section we present the results of numerical 
evaluations of our theory in graphical form. Figure 5a shows the 
relative power that is scattered into the air space above the grating. 
The incident plane wave always arrives at an angle that is small enough 
(measured with respect to the plane interface) to ensure total internal 
reflection at the smooth interface between the two media. In Figs. 5 
through 9, we use Bij, = 8.5. The grating period was chosen as 
D = 1.3\, resulting in three grating lobes labeled m = 1, 2, and 3. 
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2a/\,= 0.1 D/Ap= 1.3 B,Ag=8.5 


2a/hg= 0.1 D/Ag= 13 fP,AQ=8.5 


n,=1.5 ny = 1.0 n,=1.5 ny = 1.0 





0 0.2 0.4 0.6 0.8 





Fig. 5—Relative scattered power for the three grating lobes for D/A, = 1.3, 
Bito = 8.5, and nm, = 1.5, nz = 1.0, and for a grating amplitude of 2a = 0.1,.. 
(a) Shows the grating responses in air as functions of the grating shape factor D./D. 
(b) Shows the grating lobes in the substrate (index 71). 


The refractive index of the medium below the grating (called the 
substrate) 1s ny = 1.5, and the medium above the grating is assumed 
to be vacuum (or air) with ne = 1.0. The grating amplitude in Fig. 5a 
and b is 2a = 0.1\,. This grating amplitude is already too large for 
perturbation theory to be accurate, but the zeros (or minima) and 
maxima of the grating responses can still be identified with the help 
of (37). Consider, for example, the second-order grating lobe with 
m = 2, First-order perturbation theory predicts that it has zero power 
at D,/D = 0.5. Figure 5a for the grating responses in air shows that 
the zero of the second-order grating lobe is indeed very close to this 
value. The corresponding minimum (the power does not actually go 
to zero) for the substrate beam with m = 2 is, according to Fig. 5b, 
located at D,/D = 0.4. Its position is shifted from the value that 
perturbation theory would predict, but the reason for the occurence 
of this minimum. is still clearly discernible. The zeros for the third- 
order grating lobe, m = 3, would be located at Di/D = 3 and 
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D;/D = 3, if perturbation theory would apply. Correspondingly, 
Fig. 5a and b show that the zeros of the third-order grating responses 
are indeed close to these values. 

Another interesting relationship results if we compare the power 
carried by the beams at Di/D = 1. According to perturbation theory, 
we should find that the power ratio between the grating lobes m = 1 
and m = 2 is 4, while the ratio between the lobes with m = 1 and 
m = 3 should be 9. According to Fig. 5a these power ratios are 4 and 
8.8, respectively. The substrate beams shown in Fig. 5b give ratios of 
3.8 and 1.8, respectively. The third-order grating response in the 
substrate is thus already considerably larger than perturbation theory 
would predict. | 

Figure 6a and b prove that all resemblance to perturbation theory 
is lost, if we increase the grating amplitude to 2a = 0.5\,. According 
to perturbation theory, an increase of the grating amplitude by a 
factor of 5 should increase the scattered power by a factor of 25. No 


10° 


10—! 


APA 





—2 
p, 10 
—3 
10 2a/A,=0.5 BjAg=8.5 D/Ag=1.3 2a/Np= 0.5 BiAp=8.5 D/Ag= 1.3 

n,=1.5 ny=1.0 n,=1.5 n,=1.0 

AIR BEAMS SUBSTRATE BEAMS 
10~4 

0 0.2 0.4 0.6 0.8 1.0 0 0.2 0.4 0.6 0.8 1.0 
D,/D 


Fig. 6—Similar to Fig. 5a and b, except that the grating amplitude is 2a = 0.5). 
(Arrows indicate the position of scattering enhancement by specular reflection from 
grating teeth.) 
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2a/Ng= 2.0 D/dg= 1.3 PB Ag=8.5 2a/Ag=2.0  D/Ayg= 1.3 BArAQ=8.5 


n,=15 ny = 1.0 ny=1.5 =1.0 
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AIR BEAMS SUBSTRATE BEAMS 





0 0.2 0.4 0.6 0.8 1.0 0 0.2 0.4 0.6 0.8 1.0 
D,/D 


Fig. 7—Similar to Fig. 5a and b with 2a = 2.0X.. 


such increase is apparent for the first-grating order, nor is it indeed 
possible since the power in the scattered beams cannot exceed the 
input power. However, it is now possible to identify certain features 
of the curves by using geometrical optics. The arrows in Fig. 6a indicate 
the position where maxima of the grating lobes would be expected 
because of a coincidence of the direction of the grating lobes with 
specular reflection from the grating faces. The position of the arrows 
in Fig. 6a was computed from (24), (27), (28), and (29). Even though 
the agreement is not perfect, there is a strong indication that the maxima 
of the grating responses are indeed caused by specular reflection at the | 
grating faces. The position of the arrows in Fig. 6b was computed from 
(23), (26), and (27). For the substrate lobes, the condition of specular 
reflection from the grating faces agrees very well with the actually 
observed grating maxima. 

These figures show, furthermore, that very good discrimination be- 
tween different grating responses can be obtained by a blazed grating. 
Consider a grating with D,/D = 1. The first-order grating lobe in air 
carries 0.12 relative power while the corresponding substrate beam 
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carries only 0.02 relative power. The power of the higher-order grating 
lobes is less than one-third of the power in the first-order grating lobes. 
This observation has important consequences for grating couplers with 
blazed gratings, since loss of power to unwanted grating lobes can 
clearly be minimized. We shall see that even better results are obtain- 
able with gratings that have only first-order grating lobes. 

Finally, we let the grating amplitude grow to 2a = 2), and show in 
Fig. 7a and b how the third-order grating lobe now dominates the 
grating response. The maxima and minima of these curves cannot 
easily be identified by ray tracing because of the many possible ray 
paths. However, the maximum at D,/D = 1 of the curve with m = 3 
in Fig. 7a seems to be caused by the specular reflection indicated in 
Fig. 4. The accuracy of the curves in Fig. 7a and © is not as high as that 
of the other figures. Whereas 42 simultaneous equations were sufficient 
to solve the problem with sufficient accuracy for 2a = 0.5\., 66 simul- 
taneous equations were used to produce Fig. 7a and b. Computing 
time increases with the third power of the equation number. The ac- 
curacy of the curves in Fig. 6a and b is better than 10 percent, but the 
accuracy of the curves in Fig. 7a and b is poorer. However, these 
curves are certainly correct to order of magnitude and have the cor- 
rect shapes. 


2a/Ay=0.5 D/Aj=05 B,AQ=8.5 





0 0.2 0.4 0.6 0.8 1.0 
D,/D 
Fig. 8—Grating with only first order lobes. D/\. = 0.5, Bito = 8.5, m1 = 1.5, 


ne = 1.0, and 2a/A. = 0.5. The dotted line represents the relative scattered power in 
air; the solid lines represent the power in the substrate. 
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To gain insight in the beneficial effects of blazed gratings, a grating 
with only first-order lobes, D/A, = 0.5, Bit. = 8.5, and an amplitude 
of 2a = 0.5, was investigated. The results are plotted in Fig. 8. It is 
apparent that power is scattered predominantly into the substrate 
(solid line), if Di/D is small and predominantly into air (dotted line) 
if D,/D approaches unity. Figure 9 shows the ratio of air-to-substrate 
beam power for D,/D = 1 as a function of the grating amplitude 2a. 
Also shown in this figure is the power-reflection coefficient of the 
specularly reflected component; that is, the zero-order beam in the 
substrate. As the grating becomes deeper, the power discrimination 
between air and substrate beams becomes better, but the power re- 
flection coefficient of the specular-beam component becomes lower. If 
we apply this situation to waveguide geometry, the incident plane wave 
and the reflected wave with m = 0 would both correspond to the guided 


el 


D/Ag=05 BA,=8.5 
D,/D=1.0 





0 0.1 Oi? * «4 0.3 0.4 0.5 0.6 
2a Ag 


Fig. 9—Ratio of the powers AP,/AP, that are scattered into air and substrate and 
also the power reflection coefficient R of the zero-order beam in the substrate as 
functions of the normalized eranne depth 2a/r.. It is D/X. = 0.5, Bido = 8.5, 
D,/D = 1.0, ny = 1.5, and nz = 1.0. 
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0 0.1 0.2 0.3 0.4 0.5 0.6 
2a /Xo 


Fig. 10—Normalized scattered power into air (dotted line) and substrate (solid 
lines) for a grating with 2 = 3.5, n. = 1.0, D/A. = 0.26, Di/D = 1.0, and Bir. 
= 19.83. The arrows indicate the position of points where specular reflection from the 
grating teeth coincides with the grating condition. 


mode. The power loss on reflection expresses the mode attenuation per 
“bounce.” A grating with good power discrimination between air and 
substrate beams suffers very high scattering losses. 

Figures 10 and 11 complete our investigation of the scattering 
properties of blazed gratings with large-grating amplitudes. These 
figures apply to a substrate with high-refractive index, m1 = 3.5. The 
gratings have D,/D = 1 in Fig. 10 and D,/D = 0 in Fig. 11. In both 
figures we used D/\, = 0.26 and B,;A. = 19.83. These figures show the 
scattered power as functions of the grating amplitude. It is obvious 
that the scattering levels off with increasing grating amplitude, so 
that it does not help to increase the grating depth beyond a certain 
point. However, the discrimination between air and substrate beams 1s 
affected by the grating depth. The arrows indicate points where specular 
reflection from the grating faces should enhance the scattered power. 
Except for obvious interference effects by some other ray path, 1t seems 
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SUBSTRATE BEAM 
I 


D,/D=0 fjAg=19.83 D/Ag= 0.26 


n,=3.5 n=1.0 





0 0.1 0.2 0.3 0.4 0.5 0.6 
2a / rN 0 


Fig. 11—Similar to Fig. 10 with Di/D = 0. 


that the maxima tend to be located where geometrical optics would 
predict. 

We have extracted information about the effective reflection plane 
of the incident and specularly reflected plane waves. The theory of the 
effective reflection plane was presented in eqs. (34) and (85). Figure 
12 shows the position d (normalized with respect to \.) of the effective 
reflection plane measured from the lower edge of the grating at x = 0. 
We assume that the phase of the zero-order beam in the substrate can 
be accounted for by reflection from an effective plane interface of the 
two media with index n; and nz located at x = d. The solid lines in 
Figs. 12 and 13 are obtained from our exact theory. The dotted curves 
represent the results of applying the WKB approximation to a con- 
tinuous refractive index distribution as explained in connection with 
eq. (34). Figure 12 applies to a long grating period of D/A, = 1.3 and 
ni = 1.5, while Fig. 13 was drawn for D/\, = 0.26 and m = 3.5. For 
large grating amplitudes the agreement with the approximate theory 
is apparently better for shorter grating periods. It might be expected 
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that the approximation would become very good for D/A — 0. Our 
use of the WKB approximation becomes inapplicable to 2a— 0. In 
this limit the effective reflection plane is better approximated by d = a. 
However, it is clear that the WKB approximation provides a useful 
estimate of the position of the effective reflection plane for deep 
gratings. This information is very important for an application of our 
theory to an approximate description of scattering by gratings on 
dielectric film waveguides. 

The final figure, Fig. 14, shows a comparison between first-order 
perturbation theory and the exact grating theory. This figure repre- 
sents the relative scattered powers in the first-order grating lobe (the 
only lobe that propagates in this case) as a function of the grating 
amplitude 2a for D/d, = 0.5, Bit. = 8.5, ny = 1.5, and ne = 1.0. It is 
interesting to observe that the air beam is actually stronger than first- 
order perturbation theory would predict, while the substrate beam is 
considerably weaker. It is furthermore of interest that the relative 
strength of air to substrate-scattered power is predicted in reverse 
order by perturbation theory for large grating amplitudes. Whereas 
perturbation theory predicts that more power is scattered into the 
substrate than into air, the exact theory predicts just the opposite. At 


D/dg= 1.3 BAg=85 


n,=1.5 Ny = 1.0 





0 0.2 0.4 0.6 0.8 1.0 12 1.4 1.6 1.8 2.0 
2a/Xpo 


Fig. 12—Position d of the effective reflection plane as a function of grating ampli- 
tude 2a for D/d\. = 1.8, Bitn = 8.5, m1 = 1.5, and nz = 1.0. The effective reflection 
plane is practically independent of D:. The solid line is obtained from the exact 
theory while the dotted line was computed from the wKB approximation. 
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n,=3.5 ny =1.0 





0 0.05 0.10 0.15 0.20 0.25 0.30 0.35 0.40 0.45 0.50 
2a /Xo 


Fig. 18—Similar to Fig. 12 with D/A. = 0.26, BiA. = 19.83, m1 = 3.5, and n, = 1.0. 


this point our theory is at variance with claims made by Tamir’ whose 
results are in qualitative agreement with perturbation theory but dis- 
agree with our theory. 

The arrow in Fig. 14 indicates the point at which the specular reflec- 
tion condition from the faces of the grating teeth is satisfied for the 
substrate beam as shown in Fig. 2. This point is in good agreement with 
the maximum predicted by the exact theory. Perfect agreement cannot 
be expected for such small grating amplitudes and short grating 
periods, because geometrical optics cannot be expected to hold under 
these conditions. 

Figure 14 shows that for this type of grating first-order perturbation 
theory is reasonably accurate for grating amplitudes below 2a = 0.05),. 


Il. CONCLUSION 


We have found that our exact treatment of deep dielectric gratings 
with triangularly shaped teeth provides a satisfactory method for 
computing the scattering problem. Our theory has only been applied 
to incident plane waves of TE polarization. The field outside of the 
grating region was expanded in a series of plane waves, while the field 
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in the grating region was expressed as a double Fourier series expansion. 
Numerical evaluation of this scattering theory requires a modest 
computational effort. The required number of simultaneous equations 
that must be solved increases with increasing grating amplitude. The 
computer time increases with the third power of the number of equa- 
tions used. For gratings with an amplitude of 2a/\, = 0.1, 28 simul- 
taneous equations were used, 42 equations were necessary for 2a/A 
=0.5, and 66 equations were used for 2a/A, = 2.0; however, somewhat 
greater accuracy seems desirable for accurate results in this latter case. 
We found that blazed dielectric gratings are able to provide good 
discrimination of one grating lobe at the expense of other grating re- 
sponses. The position of maxima and minima of the grating lobes as 
functions of the grating shape can be accounted for by perturbation 
theory for small grating amplitudes and by geometrical optics for 
larger grating amplitudes. However, multiple ray paths make the 
geometrical optics interpretation difficult for very deep gratings. 
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PERTURBATION THEORY 
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107% D/A,=0.5 B,A,=85 D,/D=1.0 
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Fig. 14—Comparison with first-order perturbation theory. D/A, = 0.5, Di/D = 1, 
Bido = 8.5, m1 = 1.5, and nz = 1. 
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It is possible to define an effective reflection plane for the zero-order 
reflected grating lobe. This concept is useful for an approximate de- 
scription of grating scattering of guided modes in thin dielectric films. 
Once the effective width of the film is known for the guided modes, 
scattering losses can approximately be calculated by accounting for 
the scattered power by means of a theory that is essentially no more 
complicated than the theory presented here. We have shown that the 
position of the effective reflection plane can be estimated by means of 
the WKB approximation. 
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APPENDIX 
We list here the coefficients that enter in the equation system (14). 


ve ee ee 
_ (ri — n3)K% J See Gx a ) 
SoG A) 1 — exp | 22x | = (m! — m) +5 (n! — n) 
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Models for the Subjective Effects of Loss, 
Noise, and Talker Echo on Telephone 
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Tests have been conducted at Bell Laboratories within the last 10 years 
to obtain subjective evaluations of the effects of loss, noise, and talker echo 
on telephone transmission quality. We use these subjective test results to 
formulate graphical and analytical models of subjective opinion that can 
be used in network planning studies to evaluate transmission performance 
of the network and to study the effects of network changes on performance. 
These models are based on the concept of a generalized transmission- 
rating scale. Separate opinion curves for each test take into account differ- 
ences caused by factors such as subject group, type of test, and range of 
condtitons. We also describe the methods of data analysis used in the 
formulation of the transmisston-rating scale and opinion models, provide 
a comparison of the test results with the models, and discuss the models in 
sufficient detail to permit their application in transmission planning 
studies. 


I. INTRODUCTION 


In the last 10 years, several tests have been conducted at Bell 
Laboratories to determine the subjective evaluation of loss, noise, and 
talker echo in telephone connections. The purpose of these tests was 
to obtain information for use in network planning studies. Several 
hundred volunteers served as subjects and participated in several 
thousand test calls with various amounts of loss, noise, and talker 
echo. Some of these tests were conducted on normal business calls 
made by Bell Laboratories employees; others were conducted in a 
laboratory environment. At the end of each call, the subject was asked 
to indicate his or her opinion of the transmission quality on a five- 
category rating scale: excellent, good, fair, poor, and unsatisfactory. 

The results from these tests were used to formulate the graphical 
and analytical models of opinion which are presented in this paper. 
These models have been used extensively in network planning studies 
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to evaluate present network performance and to study the effect of 
possible changes in the network. Although many of these studies are 
not yet published, a recent paper by Spang! provides an excellent 
example of the application of these models in toll transmission planning. 

For many years the Bell System has used subjective tests to obtain 
information on the effects of transmission quality which could be used 
in network planning studies. For example, the present Via Net Loss 
design plan, which introduces loss to control talker echo in the Direct 
Distance Dialing (ppp) network, is based on the results of subjective 
tests for talker echo that are included in a 1953 paper by Huntley.? 
Subsequent talker-echo tests were described by Phillips in 1954.3 
Coolidge and Reier reported the results of tests of received telephone 
speech volume in 1959 and introduced the concept of volume grade-of- 
service.* Test results for message circuit noise were used in noise 
grade-of-service studies described by Lewinski in 1964.5 

Most of the tests mentioned considered the effect of one transmission 
parameter at a time. Since transmission parameters appear in com- 
binations and there are, In many instances, Important interactions, a 
new series of conversational tests for the combined effects of connection 
loss and circuit noise was initiated in 1965. Subjective evaluations were 
obtained on normal business calls within Bell Laboratories. The results 
of these tests were reported by Sen in 1971.° Since then, the use of 
combined loss-noise grade-of-service based on these tests has largely 
replaced the use of the earlier noise and volume grade-of-service. 

Another test to determine subjective reaction to loss and noise was 
conducted in 1969 on normal business calls within the Bell Laboratories 
location at Holmdel, New Jersey. There were two reasons for the new 
test. One was to obtain a larger number of subjects per test condition 
and thus reduce the experimental variability. The other reason was to 
include both symmetric loss conditions, as was done in the 1965 tests, 
and asymmetric loss conditions; i.e., unequal loss in the two directions 
of transmission. The test results for the symmetric conditions indicated 
a more critical assessment of quality than the 1965 tests, which could 
not be explained by known differences in the tests. Therefore, a third 
test was planned and conducted in 1972. 

During this same period of time, new talker-echo tests were initiated. 
These echo tests used the same five-category rating scale as the loss- 
noise tests so that possible tradeoffs between loss-noise and echo grade- 
of-service could be studied. Some of the echo tests were conducted in 
the laboratory. Others were conducted on normal business conversa- 
tions between Bell Laboratories employees, where values of loss, noise, 
echo-path loss, and echo-path delay could all be controlled. 

As the results of the various tests became available, work was con- 
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tinued to modify and improve earlier loss-noise opinion models and to 
develop similar talker-echo opinion models. Systematic methods were 
formulated to analyze the results from individual tests and to combine 
the results from different tests into a composite loss-noise-echo opinion 
model. In addition, a transmission-rating scale was introduced that 
assigned a single numerical value to any specific combination of trans- 
mission conditions. 

The concept of a generalized transmission-rating scale recognized 
that subjective test results can be affected by various factors such as 
the subject group, the type of test, and the range of conditions that 
are included in the test. These factors were found to cause changes in 
both the mean opinion score for a given condition and in the standard 
deviation.* Thus, there were difficulties in trying to establish a unique 
relationship between a given transmission condition and subjective 
opinion in terms of mean opinion score or other subjective measures 
of transmission quality. The introduction of a transmission-rating 
scale tended to reduce this difficulty by separating the relationship 
between transmission characteristics and opinion ratings into two 
parts. For the first part, the transmission rating as a function of the 
transmission characteristic, was anchored for two specific transmission 
conditions and thus tended to be much less dependent on individual 
tests. The second part, the relationship between transmission rating 
and subjective opinion ratings, could then be displayed for the in- 
dividual test. 


The essential features of the transmission-rating scale and opinion 
models are summarized in Section II of this paper in enough detail 
to permit their application in transmission planning studies. The re- 
mainder of the paper describes three subjective tests for connection 
loudness loss and circuit noise and four tests for talker echo, outlines 
the methods of analysis, and describes the formulation of the composite 
transmission-rating scale and opinion models. Comparisons of the 
individual test results and the final model are also presented. 


Il. SUMMARY OF TRANSMISSION-RATING AND OPINION MODELS 


The models for transmission rating described in this section are 
based on the results of seven subjective tests. All of the tests were 
conducted with Western Electric 500-type telephone sets.7 Loudness 


* Results of subjective tests in terms of the number of votes in each of the several 
categories of a rating scale can be expressed in a number of ways. One way is to 
assign numerical values to each of the categories, e.g., excellent = 5, good = 4, 
fair = 3, poor = 2, and unsatisfactory = 1. Each of these numerics is then weighted 
by the proportion of votes in the corresponding category for a particular transmission 
condition, and the weighted values summed. The result is ne the mean opinion 
score for that transmission condition. 
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Table |—Summary of tests 


Loss-Noise Tests Talker Echo Tests 
SIBYL SIBYL SIBYL Lab Lah Lab SIBYL 
1965(MH) | 1969(HO1) | 1972(HO2) 1966 1968 1970 1970 
Number of subjects 66 78 74 29 30 100 45 
Number of conditions 24 15 12 30 93 10 16 
Number of ratings 685 1163 1684 870 2790 1000 752 
Median ratings/ 
condition 29 60 60 29 30 100 38 
Connection loudness 
loss (dB) 5-30 10-30 5-30 * * 18 10 
Circuit noise (dBrnC) 21-44 22-45 25-42 28 18-38 33 30 
Echo-path loudness 
loss (dB) — — — 10-59 | 0-50 33-73 6-42 
Echo-path delay (ms) — — — 20-90 | 1.5-90 | 600, 1200 | 10-72 
Sidetone-path loudness 
loss (dB) 13 12 12 9 12 12 12 
Average room noise 
[dB (A)] 45 42 42 35 35 38 42 


* These tests were not, strictly speaking, two-way conversation tests and, thus, connection loudness- 
loss values are not appropriate. 


loss values used in the model describe the acoustic-to-acoustic transfer 
efficiency of overall telephone connections and are expressed in terms 
of the Electro-Acoustic Rating System (Ears) method.® Noise values 
used in the model are expressed at the line terminals of a telephone set 
with a reference receiving efficiency of 26 dB based on the Ears 
method. * 

The major aspects of these tests are summarized in Table I. A more 
detailed description of the tests is presented in Sections III and IV. 

The results of the subjective tests were used to derive transmission- 
rating models for (2) loss and noise, (77) talker echo, and (277) the com- 
bined effects of loss, noise, and talker echo. In addition, models were 
derived for the relationship between transmission rating and subjective 
opinion. 

The procedures used in the analysis of the subjective-test results 
and the derivation of the transmission-rating scale are described in 
Sections V and VI. Although the procedures are somewhat complex 
for manual calculation, they are easily handled on a digital computer 
and have been found to provide a convenient and useful representation 
for a large variety of test data. 

Mathematical expressions for the models are summarized in Table 
II. The derivations of these expressions are also given in Sections V 


“The several subjective tests, results of which were used in deriving the model, 
were conducted with different circuit-noise values and with telephone sets operating 
at different Ears receiving efficiencies. To enable combination of results from the 
different tests to be made, it was necessary to express all circuit-noise values in terms 
of a telephone set with reference receiving sensitivity. The value of 26 dB was chosen 
because it is approximately the receiving efficiency of a customer loop consisting of a 
Western Electric 500-type telephone set,’ a short-line facility, and a standard central 
office feeding bridge. Noise values are given in dBrnC.°:!° 
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and VI. The remainder of the present section provides a general 
description and graphical presentation of the models. 


2.1 Connection loudness loss and circuit-noise model 


Transmission rating as a function of connection loudness loss and 
message-circult noise is shown in Fig. 1. The curves were plotted using 


Table Il— Models for estimating subjective reaction to loss, 
noise, and echo 


The models, in terms of a transmission-rating scale, for loss and noise (Rzy), 
echo (Rz), and loss, noise, and echo (Rryz) are: 


Riv = 147.76 — 2.257V (ZL. — 7.2)? + 1 — 2.009Nr + 0.02087(L.)Nr (1) 
Re = 95.01 — 53.45 logio{ (1 + D)/V1 + (D/480)?} + 2.277E (2) 


Rive = EAE aif Sex =e T+ coy, (3) 


where LZ, = Acoustic-to-acoustic loudness loss (in dB) of an overall telephone connec- 
a eober med using the Electro-Acoustic Rating System (EARS) 
method, 
N = Circuit noise (in dBrnC) at the input to a set with a receiving-loudness 
rating of 26 dB, determined using the Ears method, 
Nr = Total noise in dBrnC resulting from power addition of the circuit noise, 
N, and 27.37, both in dBrnC, 
D = Round-trip echo-path delay (in milliseconds), and 
EK = Acoustic-to-acoustic loudness loss (in dB) of the echo path, determined 
using the EARS method. 


The proportion of comments good or better (GoB) or poor or worse (PoW) are com- 


puted from FR by: 
1 fA 
GoB = — i: e~Pi2dt 
V2 oo 


1 t+) 
Pow = — [ oP lagt 
V2n JB 


where A and B are given in the table below for the various data bases. 


Data Base A B 
MH (R — 64.07)/17.57 (R — 51.87)/17.57 
HO1 (R — 77.44)/17.07 (R — 60.70)/17.07 
HO2 (R — 73.74)/15.68 (R — 58.08)/15.68 
Echol (R — 75.05)/14.30 (R — 58.95)/14.30 
Echo2 (R — 66.66)/11.84 (R — 53.33)/11.84 


The parameters A and B have been derived from opinion distributions, in terms of 
fit mean, yw, and fit standard deviation, a, and then expressed as a function of R&R. 
Alternatively, the models can be expressed as follows: 


pun = (R — 21.37)/12.20 og = 1.44 

bHo1 = (umn + 0.206)/1.8372 = (R — 18.86)/16.74 o, = 1.02 

Hoo = (uma + 0.215)/1.288 = (R — 18.75)/15.71  o, = 0.998 
bE. = (R — 18.7)/16.1 gq = 0.888 
Bee = (R — 20)/13.33 oq = 0.888 
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Fig. 1—Transmission rating for loss and noise. 


eq. (1) below, which is also given in Table II. 


Riv —_ 147.76 = 2.257 (L, at (i2)" + 1 = 2.009N p 
+ 0.02037(L.)Nr, (1) 
where 


DL, = Acoustic-to-acoustic loudness loss (in dB) of an overall 
telephone connection 
N = Circuit noise (in dBrnC) at the input to a set with a receiving 
loudness rating of 26 dB 
Nr = Power addition of the circuit noise, N, and 27.37 dBrnC. 


The transmission-rating scale was derived, so that 1t is anchored at 
two points, as shown in Table III. These anchor points were selected 
to be well separated in quality, but within the range of conditions 
that are likely to be included in a test. Transmission ratings for other 
combinations of connection loudness loss and circuit noise are relative 
to those for the two anchor points. The rating values are such that 
most telephone connections will have positive ratings between 40 and 
100, with the higher rating denoting higher quality. For most engineer- 
ing applications sufficient accuracy can be achieved by the use of 
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Table IIl— Anchor conditions for the transmission-rating scale 


Connection Loudness 


Loss Circuit Noise Transmission 
(dB) (dBrnC) Rating 

15 25 80 

30 40 40 


whole numbers on the transmission-rating scale. For example, in the 
1965 loss and noise tests, conditions with a transmission rating of 
approximately 80 were considered good or excellent by 80 percent of 
the subjects, while a transmission rating of 40 was considered good or 
excellent by only 10 percent. 


2.2 Talker-echo model 


Transmission rating as a function of talker-echo path loss and delay 
is shown in Fig. 2. The curves were plotted using eq. (2) below, which 
is also given in Table II. This equation was derived to exclude the 
effects of circuit noise and connection loudness loss. 


Ry = 95.01 — 53.45 logio {{1 + D]/vV1 + (D/480)?} + 2.277E, (2) 
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Fig. 2—Transmission rating for talker echo. 


TELEPHONE CONNECTION QUALITY = 1325 


where 


D = echo-path delay (in ms) and 
E = acoustic-to-acoustic loudness loss (in dB) of the echo path. 


The curves of Fig. 2 demonstrate the dependence of transmission 
quality on the two talker-echo path parameters, loss and delay, for 
connections where talker echo is important. 


2.3 Connection loudness loss, circuit noise, and talker-echo model 


Transmission ratings for the combined effects of connection loudness 
loss, circuit noise, echo-path loudness loss and echo-path delay are 
obtained from eq. (3) below, which is also given in Table II. 


—— 2 
Rive = Xt Ae _ (A) + (10)?, (3) 


where 


Rivz = transmission rating for the combined effects of connection 
loudness loss, circuit noise, and talker echo, 
Riy = transmission rating for connection loudness loss and circuit 
noise 
Rz = transmission rating for echo-path loudness loss and delay. 
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Fig. 3—Transmission rating for loss, noise, and talker echo. 
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Figure 3 illustrates curves generated by means of the above relation- 
ship for the transmission rating as a function of echo-path loudness 
loss and delay in a connection with a connection loudness loss of 15 dB 
and circuit noise of 30 dBrnC. For other values of connection loudness 
loss and circuit noise, the curves would become asymptotic to higher or 
lower values of & in accordance with the curves of Fig. 1. 


2.4 Subjective-opinion models 


Subjective opinion in terms of the proportion of ratings in each of 
the five categories (E, G, F, P, U) for a condition having a given 
transmission rating has been found to depend on various factors, such 
as the subject group, the range of conditions presented in a test, the 
year in which the test was conducted, and whether the test was con- 
ducted on conversations in a laboratory environment or on normal 
telephone calls. For the major tests on which the transmission-rating 
model is based, the observed relationship between subjective judgments 
and transmission rating can be represented as shown in Figs. 4 and 5 
which are plotted from the equations for GoB and PoW of Table II. 
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Fig. 4—Subjective opinion as a function of transmission rating for the connection 
loudness loss and circuit-noise tests. 
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Fig. 5—Subjective opinion as a function of transmission rating for the echo tests. 


Contours of constant percent good or better and percent poor or 
worse are given in Figs. 6 and 7, respectively, for the loss-noise results. 
These contours were computed from eq. (1) and the equations for 
GoB and PoW of Table II using the MH data base. Similar contours 
could be generated based on one of the other tests. However, the 
Murray Hill base is being used for current network planning studies 
for consistency with earlier studies. In addition, the opinion results 
from this test appear to be in close agreement with data obtained from 
customer interviews on typical ppp toll connections. Thus, at the time 
of publication, the Murray Hill base is recommended for conversion 
of the transmission ratings to subjective ratings. Eventually, we hope 
to determine other expressions for A and B (see Table II) that will 
provide even better agreement with customer interviews on varlous 
types of telephone connections. 


2.5 Use of the models 


The models summarized in preceding sections can be used to esti- 
mate transmission quality for telephone connections. The examples 
given below are based on representative 500-type telephone sets con- 
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nected by specified lengths of 26-gauge nonloaded cable to the local 
Class 5 office.! The loss for such an arrangement is expressed 1n terms of : 


TLR = Transmitting loop rating (in dB), which describes the loud- 
ness conversion efficiency in terms of an acoustic signal 
applied at the telephone set and the resulting electric voltage 
at the local Class 5 office. 

RLR = Receiving loop rating (in dB), which describes the loudness- 
conversion efficiency in terms of an electric circuit voltage 
applied at the local Class 5 office and the resulting acoustic 
pressure received at the telephone set.® 


The connection loudness loss, £,, for a local call is the sum of TLR and 
RLR. For calls between customers served by different local Class 5 
offices, L, is the sum (TLR + RLR) plus the 1000-Hz loss between the 
offices. 

The circuit noise, N (in dBrnC), as used in the model is in terms 
of a reference telephone set that has an ruR of 26 dB. Noise levels, as 
typically expressed at the line terminals of a telephone set, need to 
be corrected to the reference set. For the 500-type telephone set, the 
conversion factor 1s about 4 dB (increase in noise) and is nearly in- 
dependent of loop length. 

The effect of talker echo depends on the characteristics of the echo 
path. Generally, the dominating path is that from the talking customer 
to the distant Class 5 office and return, and is referred to as the far- 
end echo path." The round-trip delay of this path, D, is taken to be 
the time required for a speech signal to go from the originating Class 5 
office to the distant Class 5 office and return. (Delay in the customer’s 
telephone set and loop is neglected as, with present plant, it is usually 
Insignificant.) The loudness loss of the echo path, EH, is the sum of 
TLR, RLR, and the echo-path loss from the originating Class 5 office to 
the distant Class 5 office and return. 

Considerations of the preceding paragraphs provide the basis for 
several examples demonstrating use of the models. Details concerning 
computation for these examples are given in Appendix A. Results of 
these examples are summarized in Table IV. The examples are simpli- 
fied representations of connections devised to illustrate application of 
the models and, thus, the results only approximately describe the 
performance of actual connections. (Methods of obtaining more accu- 
rate connection representations are covered in Ref. 1.) 

Comparison of the results for Examples 1, 2, and 3 show that about 
optimum performance for local connections occurs for medium loops. 
The performance is below optimum for short and long loops, the former 
because the loss is lower, the latter because the loss is higher (see 
Figs. 1, 6, and 7). 
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Table I1V— Examples of performance estimates obtained 
using the models 


Le N D E GoB | PoW 
Example (dB) (dBrnC) (ms) (dB) Rin Re RLNE (%) (%) 

1. Local connection, 
short loops 2.7 27 — — 78.3 — — 79.2 6.6 

2. Local connection, 
medium loops 8.4 27 — — 88.7 — — 92 1.8 

3. Local connection, 
long loops 16.9 27 — — 75.5 _— — 74.2 8.9 

4, Long-toll connection, 


medium loops, no 

talker echo 16.1 32.5 — — 71 — — 65.2 | 13.9 
. Long-toll connection, 

medium loops, with 

talker echo 16.1 32.5 37.38 | 31.7 | 71 82.6 | 65.2 | 52.6 | 22.4 


or 


Comparison of Examples 3 and 4 illustrates the effect of higher loss 
and noise typically encountered on toll connections.” Finally, compari- 
son of Examples 4 and 5 indicates the effect of talker echo, demon- 
strating the need for echo control. (See Ref. 1 for detailed discussion 
of loss, noise, and talker echo for toll connections.) 


It. DESCRIPTION OF LOSS-NOISE SUBJECTIVE TESTS 


Three tests have been conducted using a special test facility called 
SIBYL to determine subjective reaction to loss and noise on telephone 
connections. This facility allowed control of transmission parameters 
during normal business calls of cooperating Bell Laboratories 
employees.#—15 

SIBYL was first used at the Murray Hill Bell Laboratories location, 
and was moved to the Holmdel location in 1966. 

The test subjects for the s1syL studies were Bell Laboratories em- 
ployees. Prior to the beginning of each of the tests, a list of employees 
reflecting a preselected makeup of age, sex, etc., was obtained, and the 
employees contacted to solicit their participation in the test. Upon 
obtaining agreement, their telephone lines were routed through siByYL 
which could handle up to 100 subject lines. 

An overall connection with srspyL inserted is shown on Fig. 8. At 
the left is a subject’s (participating employee’s) telephone set. In 
close physical proximity (less than about 1500 feet of two-conductor 
cable) is sIByL which converts the two-wire transmission path into a 
four-wire transmission path and separates signals transmitted from 
the subject’s telephone set and signals received at the same telephone 
set. Separation of the signal paths in this manner permits (7) inserting 
different impairment values for the two directions of transmission and 
(22) independent measurement of signals transmitted from and received 
at the subject’s telephone set. 

Proceeding from left to right in Fig. 8, the four-wire path is re- 
converted to a two-wire path in srByL, and connected to the serving 
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Fig. 8—Diagram of a telephone connection for the sIBYL tests. 


central office over a two-wire cable pair which is about 1 mile in length 
for Murray Hill and about 3 miles in length for Holmdel. The central 
office switching machine connects the subject’s line to the called line 
dialed by the subject. The called line (approximate lengths as given 
above for Murray Hill and Holmdel) is terminated with the telephone 
set of the called party, another Bell Laboratories employee at the 
same location who is usually not a subject in the experiment. 

SIYBL recognized calls internal to the location, and only such calls 
were included in the experiments reported here. The major reason for 
this was to retain as much control as possible of the transmission 
parameters on test calls. Employees’ telephone lines at any one location 
generally were of about the same physical makeup. Thus, variations 
in transmission parameter values between lines were small, and the 
values were considered to be identical for all lines. srpyL then altered 
the normal parameter values to achieve the parameter values of interest 
In an experiment. 

Restricting the calls to within the location had the further advantage 
that the subjects physical environment was reasonably uniform, con- 
sisting largely of offices with one to four desks and of electronic labora- 
tories. Thus, room noise levels, which could affect the transmission 
quality of telephone calls, did not vary appreciably. 

The subjects were provided with instructions at the beginning of the 
test, and thereafter, the procedure during a call was generally as shown 
in Fig. 9. (The procedure of Fig. 9 applies specifically to the Murray 
Hill test.) The subject initiated the procedure by lifting his handset 
from the telephone set cradle and dialing a number. If the call was 
not a test call, which was the case for about 85 percent of the calls 
from the subject group, the call was completed normally. That is, 
(z) the subject received a signal indicating the called line was busy 
and returned his handset to the cradle, (72) the called party did not 
answer and the subject returned his handset to the cradle, or (722) 
the called party answered and they conducted their conversation, 
after which the subject and called party returned their respective 
handsets to the cradles. 
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If the call was a test call and the called line was not busy, a test 
condition (e.g., a predetermined combination of loss and noise values) 
was inserted at the beginning of the call, and the conversation pro- 
ceeded. If, during the call, the test condition was unacceptable to the 
subject (e.g., the noise was too loud) the subject could dial a digit 
(e.g., a five) on his telephone set dial. This signaled sriByL to remove 
the test condition, after which the call completed normally. (This 
occurred on only about | to 2 percent of all test calls.) 

In the more usual case, the subject was either not consciously aware 
of the test condition or did not find it unacceptable and completed the 
conversation, then returned the handset to the cradle. A short time 
thereafter, typically seconds, the subject received ringback—the 
telephone set ringer emitted a burst of sound—which alerted the 
subject to rate the transmission quality of the call just completed. 


3.1 1965 Murray Hill SIBYL test 


In 1965, a test to determine subjective reaction to loss and noise 
was conducted using the srByu facility, which was then located at Bell 
Laboratories, Murray Hill, New Jersey. The configuration of a typical 
connection incorporating SIBYL is shown on Fig. 8 as discussed earlier. 


TEST SUBJECT DIALS CALL 


NOT A TEST CALL TEST CALL 
CALL COMPLETES TEST CONDITION 
NORMALLY APPLIED 
CALL PROCEEDS 
NORMALLY 


IF CONDITION 
IS UNACCEPTABLE, TEST SUBJECT 
SUBJECT DIALS 5 DOES NOT DIAL 5 


TEST CONDITION CALL COMPLETES 
REMOVED NORMALLY 

CALL COMPLETES SUBJECT GETS 
NORMALLY RINGBACK 


SUBJECT RATES 
CONDITION 


_ Fig. 9—Procedure for a sIBYL call. 
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Table V—1965 Murray Hill SiByL test of loss and noise: 
number of ratings for each test condition 


Connection Circuit Noise (dBrnC) 
Loudness Loss 

(dB) 21 28 36 AA 

5 35 12 24 20 

10 61 36 34 34 

15 33 33 a] 28 

20 29 43 23 35 

25 35 32 29 24 

30 12 12 23 11 


The 24 combinations of loss and noise values that were tested are 
given in Table V together with the number of subjective ratings ob- 
tained for each combination. The loss values of Table V represent 
acoustic-to-acoustic loudness loss (in dB) of the connections, and are 
numerically equivalent to the electrical losses in terms of which results 
were published earlier for this test.° Circuit noise levels (in dBrnC) 
were originally reported in terms of (2) added noise, and did not include 
allowance for noise normally present on the connections, and (22) the 
average receiving sensitivity of Western Electric 500-type telephone 
sets at the Murray Hill location. The noise levels of Table V represent 
total noise from all sources and are expressed at the line terminals of a 
telephone set with reference receiving sensitivity. The condition with 
loudness loss = 5 dB and noise = 21 dBrnC represents average trans- 
mission normally experienced by employees on calls within the Murray 
Hill location. 

The lines connecting employees’ telephone sets to the central office 
were all of about the same length. Thus, the loudness loss of the 
telephone side tone path,!*!” strongly dependent on line length, was 
expected to be about the same for all employees. Side tone path loud- 
ness loss was measured on a sample of lines and telephone sets, and 
found to be about 13 dB. Circuitry was incorporated into SIBYL to 
ensure that the side tone path loudness loss would also be about 13 dB 
for subjects’ telephone sets when connected to SIBYL. 

Room noise was measured at a sample of subject locations using a 
sound-level meter with A-weighting.!® The average value was found to 
be 45 dB(A) with a range of +2 dB. 

Sixty-six employees (subjects) participated in the test. The pre- 
determined combinations of loss and noise were randomly introduced 
into about 15 percent of all within-location, normal-business calls 
placed by the subject group during the 3-month test interval. A subject 
had no prior information that any particular call was being placed 
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over a test connection. The call procedure summarized in Fig. 9 was 
followed for each call. At the end of each test call, a subject judged 
overall transmission quality by dialing a 9 for excellent, 8 for good, 7 
for facr, and 6 for poor. Connections rejected during the call were 
considered to be unsatisfactory. 

Test results are given in Table VI for each test condition in terms of 
the percent of subjects’ votes in each of the five rating categories. These 
results are used in Sections V and VI in deriving the subjective-opinion 
models. 


3.2 1969 Holmdel 71 SIBYL test 

As discussed earlier, a second test to determine subjective reaction 
to loss and noise was conducted in 1969. This test utilized the srBYL 
facility which was moved to the Holmdel location of Bell Laboratories 
in 1966. A major reason for these newer tests was to examine the effect 
of asymmetric transmission conditions on subjects’ ratings. However, 
the test included 15 symmetric conditions (out of a total of 47 test 
conditions) and these are considered in this paper. 


Table VI—1965 Murray Hill SipYL test of loss and noise: 
test results 


Circuit Cannechion Percent of Subjects’ Votes in Each Category 
Noise Loudness Loss |——--- SS 
(dBrnC) (dB) Excel. Good Fair Poor Unsat. 
21 5 60.0 11.4 14.3 14.3 0 
10 63.9 26.2 3.3 6.6 0 
15 66.7 21.2 12.1 0 0 
20 41.3 27.6 24.1 3.5 3.5 
25 34.3 14.3 31.4 8.6 11.4 
30 8.3 25 33.3 16.7 16.7 
28 5 50.0 33.3 16.7 0 0 
10 66.7 30.5 0 2.8 0 
15 54.5 30.3 9.1 6.1 0 
20 25.6 30.2 18.6 16.3 9.3 
25 25.0 15.6 37.5 12.5 9.4 
30 8.3 8.3 33.3 16.8 33.3 
36 . 5 54.2 20.8 16.7 8.3 0 
10 64.7 23.6 5.9 2.9 2.9 
15 33.3 29.6 22.2 14.9 0 
20 21.7 8.7 34.8 21.7 13.1 
25 13.8 3.5 17.2 34.5 31.0 
30 0 13.0 26.1 34.8 26.1 
44 5 15.0 30.0 25.0 20.0 10.0 
10 17.6 26.5 11.8 20.6 23.5 
15 17.8 25.0 28.6 14.3 14.3 
20 8.6 bf 20.0 40.0 25.7 
25 0 4.2 25.0 50.0 20.8 
30 0 0 0 36.4 63.6 
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Table VII—1969 Holmdel 7 SIBYL test of loss and noise: 
number of ratings for each test condition 


Connection Circuit Noise (dBrnC) 
Loudness Loss 
(dB) 29 35 40 45 
10 335 48 41 38 
15 93 — ee = 
20 65 57 77 77 
25 33 34 47 
30 76 61 — $l 


The symmetric combinations of loss and noise values that were 
tested are given in Table VII together with the number of subjective 
ratings obtained for each condition. The condition with connection 
loudness loss = 10 dB and circuit noise = 25 dBrnC was the reference 
condition that was repeated frequently during the test. 

Side tone path loudness loss was measured on a sample of lines and 
telephone sets, and found to average about 12 dB with a range of 
+2 dB both for normal connections and siByL connections. Room 
noise, measured at 12 subject locations, averaged about 42 dB(A) 
with a range of about +2 dB. 

Seventy-eight employees (subjects) participated in this test. Sub- 
jects followed the same procedure during the 2$ month test interval 
as already described for the Murray Hill test. Comparison of the 
Holmdel 1 test results of Table VIII with the Murray Hill test results 
of Table VI shows that in the former, subjects gave lower ratings 
for approximately equivalent combinations of loss and noise than is 
the case for the latter. The major differences between the two tests 
were (z) the subject groups, (22) the time difference of about 4 years, 
and (777) the location. While it is not clear how these differences con- 
tributed to differences in results, the comparison suggests that sub- 
jects’ expectations were higher in the later Holmdel tests. 


3.3 1972 Holmdel 2 SIBYL test 


A third test to determine subjective reaction to loss and noise was 
conducted in 1972. This test also utilized the srpyL facility at the 
Holmdel location of Bell Laboratories. 

A major purpose of the test was to determine whether or not the 
more critical subjective evaluations found in the Holmdel 1 test would 
continue to hold. The test included combinations of crosstalk, loss, 
and noise as well as combinations of loss and noise. Only the latter are 
considered here because only the results for these conditions are used 
in the subjective-opinion model for loss and noise. 
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Table VillL—1969 Holmdel 1 SsiBYL test of loss and noise: 
test results 


Percent of Subjects’ Votes in Each Category 


Circuit Connection 
Noise Loudness Loss = |——_—__——— 
(dBrnC) (dB) Excel. Good Fair Poor Unsat. 
Ze 10 | 37.9 41,2 13.7 4.5 27 
15 29.0 40.9 va Ms 5.4 a2 
20 15.4 26.1 43.1 1a 7.7 
25 3.0 15.1 97:3 273 27.3 
30 2.6 13.2 34.2 30.3 19.7 
35 10 12.5 20.8 39.6 25.0 2.1 
20 1.8 10.5 45.6 26.3 15.8 
25 0 2.9 20.6 53.0 23.5 
30 1.6 3.3 16.4 47.5 31.2 
40 10 4.9 34.1 24.4 26.8 9.8 
20 0 1.3 14.3 53.2 31.2 
25 91 0 12.8 51.1 34.0 
45 10 0 7.9 21.0 47.4 O27 
20 1.3 0 20-1 39.0 37.6 
30 0 0 1.2 0.0 65.5 


The 12 symmetric combinations of loss and noise values tested are 
given in Table LX together with the number of subjective ratings 
obtained for each combination. These conditions covered about the 
same range as for the Holmdel 1 (HO1) sry. test, except that a 
condition of lower connection loudness loss of 5 dB was included in 
the Holmdel 2 (HO2) srpyu test to match the range of loss in the 
Murray Hill (MH) test. 

Room-noise levels and side tone path loudness loss values were 
assumed to be identical with those for the HO1 test. 

Seventy-four employees (subjects) participated in the test. (None 
of these employees had been subjects in the HO1 test.) The subjects 
followed the same procedure as has already been described for the MH 


Table IX—1972 Holmdel 2 SIBYL test of loss and noise: 
number of ratings for each test condition 


Circuit Noise (dBrnC) 


Connection 
Loudness Loss 
(dB) 25 32 42 
5 99 70 53 
10 1029 58 97 
20 50 64 46 
30 62 52 4 
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Table X—1972 Holmdel 2 SIBYL test of loss and noise: 
test results 


Circiit Connechon Percent of Subjects’ Votes in Each Category 
Noise Loudness Loss § |—------  —__——€“ 
(dBrnC) (dB) Excel. Good Fair Poor Unsat. 

25 5 49.5 35.4 13.1 0 2.0 

10 47.2 40.6 10.2 1.3 0.7 

20 14.0 26.0 40.0 18.0 2.0 

30 3.2 3.2 32.3 35.5 25.8 

32 5 27.1 35.7 31.4 5.8 0 

10 19.0 39.6 32.8 6.9 1.7 

20 4.7 6.3 48.3 34.4 6.3 

30 1.9 7.7 15.4 51.9 23.1 

42 5 11.3 11.3 35.8 34.0 7.6 

10 17.5 17.5 33.1 24.7 (e- 

20 4.3 10.9 19.6 41.3 23.9 

30 0 0 25.0 0 75.0 


and HOI tests, except that the voting procedure was changed. In 
the HO2 test, subjects were instructed that at the end of each experi- 
mental call, when they received ringback, they should rate the overall 
transmission quality of the call by dialing 9 for excellent, 8 for good, 
7 for fair, 6 for poor, and 5 for unsatisfactory. In addition, they could 
reject an unacceptable connection during a call by dialing 4. In the 
latter case, they still received ringback after the call and were asked 
to rate the quality according to the 5-point scale. (In the MH and HOI 
tests, a 4-point scale was used for post-call rating, and the fifth point; 
unsatisfactory, was assumed for dialed-out calls.) 

Results of the HO2 test are given in Table X. These results are in 
close agreement with those of the HOI1 test. Comparison of the test 
results for the three loss-noise tests are dealt with further in later 
sections covering derivation of the subjective-opinion models. 


IV. DESCRIPTION OF TALKER-ECHO SUBJECTIVE TESTS 


Talker echo occurs on a telephone connection when a portion of the 
primary speech signal is reflected at an impedance mismatch at some 
point in the connection, and returned to the talker delayed in time. 
The returned signal, talker echo, is defined in terms of echo-path 
delay and echo-path loudness loss. The echo-path delay that occurs 
because of the finite propagation velocity of the speech signal over 
transmission facilities and equipments is the time it takes the speech 
signal to traverse the path from the talker’s lips to the point of im- 
pedance mismatch, then back to the talker’s ear. Echo-path loudness 
loss represents the amount by which the talker’s speech signal is 
attenuated when traversing the same path. 
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Four tests were conducted to determine subjective reaction to 
talker echo. Three of these tests—identified as the 1966 Laboratory 
Echo Test, the 1968 Laboratory Echo Test, and the 1970 Laboratory 
Echo Test—were conducted under laboratory conditions where the 
experimenter could closely control conditions. The laboratories used 
in these tests included rooms that were acoustically designed to muffle 
both internal and external noise.'® The fourth test, identified as the 
1970 stnyL Echo Test, was conducted using the sipyt facility. 

The 1966 and 1968 Laboratory Echo Tests and the 1970 sIBYL 
Echo Tests were designed to study subjective reaction at short echo 
delays (<100 ms) such as might be encountered on long terrestrial 
connections. The 1970 Laboratory Echo Test considered the effects of 
long delays that might be encountered on connections using one- and 
two-hop synchronous-orbit satellite connections. 


4.1 1966 laboratory echo test 


The test conducted in 1966 to determine subjective reaction to 
talker echo utilized the test system shown in block diagram form on 
Fig. 10. This system provided (2) a fixed sidetone path with a loudness 
loss of about 9 dB, (zz) an echo path by means of which the subject 
heard his own voice delayed in time and attenuated under control of 
the experimenter, and (777) a transmission path from the test adminis- 
trator to the subject that had a loudness loss of 14 dB. (Transmission 
from the subject to the test administrator was obtained by means of 
an intercom system.) The administrator and subject were located in 
separate rooms for which the ambient room noise was about 35 dB(A), 
presumably sufficiently low so as to not affect subjects’ ratings. 
Circuit noise was held constant at 28 dBrnC. 

The subject was first given four practice conditions to illustrate the 
range of transmission quality. Then the actual test conditions were 
presented. The test incorporated 30 conditions, five different values of 
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Fig. 10—Test system for the 1966 and 1968 laboratory echo tests. 
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echo-path delay each with six different values of echo-path loudness 
loss. These conditions were presented in random order and each 
subject evaluated each condition once. 

At the beginning of each test session, a subject was seated in the 
test room and given general instructions for the test by the adminis- 
trator speaking over the system of Fig. 10. The administrator’s talking 
level was held constant. For each of the selected test conditions, the 
administrator spoke phonetically balanced sentences with the subject 
Tepeating each sentence immediately afterward. This continued until 
the subject arrived at a rating (excellent, good, fair, poor, un- 
satisfactory) for the test condition. Then the next condition was 
administered. 

The test conditions and the results obtained for the 29-member 
subject group are given in Table XI. These results show that for any 
given echo-path delay, the transmission quality improves with in- 
creasing echo-path loudness loss. Also, the data indicate that for any 
given echo-path loudness loss, the transmission quality is degraded 
with increasing echo-path delay. 


4.2 1968 laboratory echo test 


This test, conducted in 1968, was designed on the basis of results 
obtained in the 1966 Laboratory Echo Test. The test system was the 
same as discussed in the preceding section. 

Side tone path loudness loss was constant at 12 dB. Ambient room 
noise was about 35 dB(A). : 

The test incorporated 93 conditions. Three of these were base con- 
ditions at three different circuit-noise levels. The remaining 90 condi- 
tions represented various combinations of circuit-noise level, echo-path 
delay, and echo-path loudness loss. These conditions were arranged in 
random order and presented to each test subject in two sessions to 
avoid subject fatigue. 

The procedure for each subject followed that discussed in Section 
4,2 except that only three practice conditions were used. 

The test conditions and the results obtained for the 30-member 
subject group are given in Table XII. These test results are reported 
in part 1 of Annex 4 to Question 6/XII in Ref. 19. 


4.3 1970 laboratory echo test 


Tests to determine subjective reaction to echo on circuits with echo- 
path delays of 0 ms, 65 ms, 600 ms, and 1200 ms were conducted in 
1970. The tests consisted of a total of 25 conditions, many of which 
included echo suppressors. One condition at 0 ms, five conditions at 
600 ms, and four conditions at 1200 ms did not employ echo sup- 
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Table Xl—1966 laboratory echo-test results 


Echo Path Percent of Subjects’ Votes in Each Category 
Ta Houenee) Loss Excel. Good Fair Poor Unsat 
20 10 3.5 3.5 41.3 37.9 13.8 
15 0 10.3 48.3 37.9 3.5 

20 6.9 27.6 51.7 13.8 0 

25 41.4 48.3 10.3 0 0 

30 44.8 48.3 6.9 0 0 

35 51.6 44.9 0 3.5 0 
36 16.5 3.5 3.5 24.0 55.2 13.8 
21.5 3.5 20.7 20.7 51.6 3.5 

26.5 10.3 58.6 27.6 3.5 0 

31.5 20.7 58.6 17.2 3.5 0 

36.9 41.3 51.7 3.0 3.0 0 

41.5 62.0 31.0 3.5 3.5 0 
56 26 3.5 3.5 55.1 34.4 3.5 

31 10.3 17.3 51.7 20.7 0 

36 10.3 34.5 51.7 3.5 0 

Al 24.1 48.3 27.6 0 0 
46 51.7 41.3 3.0 0 3.0 
51 55.2 41.3 0 0 3.5 

72 29 3.5 6.9 37.9 51.7 0 

34 0 31.0 51.7 17.2 0 

39 0 48.3 48.2 3.5 0 

44. 20.7 51.7 27.6 0 0 

49 41.4 37.9 20.7 0 0 

54 44.7 48.3 3.5 3.5 0 
90 34 0 10.3 48.3 37.9 3.5 
39 6.9 31.0 48.3 10.3 3.5 

44. 10.3 48.3 41.4 0 0 

49 24.1 58.7 10.3 6.9 0 

54 41.4 44.8 13.8 0 0 

59 55.2 41.3 3.5 0 0 


pressors. Results for these conditions were used in deriving the echo 
model. 

The test system, shown on Fig. 11, provided for two-way conversa- 
tion between pairs of subjects. The two ends of the test system were 
located in separate, acoustically treated rooms for which the ambient 
room noise was about 38 dB(A). Each subject was able to hear his 
own voice by means of (2) a side tone path with loudness loss of 12 dB 
and (22) an echo path. 

One hundred pairs of subjects participated in the tests; 50 of these 
evaluated the 600-ms delay case, the other 50 evaluated the 1200-ms 
delay case. The two members of a pair (they were acquainted) were 
located in the two separate test rooms. Prior to a test session, they 
were instructed that they should discuss a subject of mutual interest 
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Table XII— Continued 


Percent of Subjects’ Votes in Each Category 


Echo Path 


Circu it 


Loudness 
Loss Iixcel. Good Fair Poor Unsat. 
(dB) 


Delay 
(ms) 


Noise 
(dBrnC) 


0 
7 
7 
0 
3 
0 


36.0 


28.0 


56.0 


« @  * 8  « «@ 
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TELEPHONE CONNECTION QUALITY 


TRANSMITTER RECEIVER 









A A 
| 
| | 
| | 


IN IN 
SUBJECT SUBJECT 
A's B's 

HANDSET 


HANDSET 


RECEIVER TRANSMITTER 
Fig. 11—Test system for the 1970 laboratory echo test. 


over the system and told that each test condition would require about 
4 minutes of conversation, at the end of which they should separately 
rate the condition on the 5-point scale: excellent, good, fair, poor, and 
bad. (These categories were further subdivided into undesignated 
thirds, resulting in a 15-point scale.) 

The test conditions and results are given in Table XIII and are also 
reported in Part III of Annex 5 to Question 6/XII in Ref. 19. As with 
tests discussed earlier, these results show that (2) at a given echo-path 


Table XIII—1970 laboratory echo-test results—Approximately 
100 ratings (subjects) per condition 


Connection loudness loss = 18 dB 
Side tone-path loudness loss =12 dB 
Circuit noise = 33 dBrnC 


Echo Path Percent of Subjects’ Votes in Each Category 
phe sags Loss Excel. Good Fair Poor Unsat. 

— 33.0 9.0 49.0 34.0 8.0 0.0 

600 33.0 1.0 1.0 2.0 17.6 78.4 

43.0 2.0 10.8 19.6 39.2 28.4 

53.0 3.9 36.3 44.1 11.8 3.9 

63.0 6.9 47.1 28.4 16.6 1.0 

73.0 6.0 44.0 42.0 6.0 2.0 

1200 43.0 1.0 5.0 19.0 38.0 37.0 

53.0 6.0 29.0 40.0 19.0 6.0 

63.0 4.0 45.0 36.0 14.0 1.0 

73.0 7.0 39.0 39.0 13.0 2.0 
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delay, transmission quality improves with increasing echo-path loud- 
ness loss, and (22) at a given echo-path loudness loss, transmission 
quality is degraded with increasing echo-path delay. 


4.4 1970 SIBYL echo test 


An echo test was conducted in early 1970 using s1ByL. The purpose 
of the test was to enable comparison of results obtained from subjects 
conversing on actual telephone calls to results obtained under labora- 
tory conditions. The results of tests reported in Section 4.2 were used 
to guide selection of the conditions for the 1970 s1ByL echo test. 

Forty-five subjects participated in these tests. Procedures followed 
by the subjects were the same as those for the loss-noise tests reported 
in Section 3.3. 

Test variables were echo-path delay (three values) and echo-path 
loudness loss (five values for each delay). In addition, a condition 
without echo was included as a reference. 

Connection loudness loss was 10 dB, the base condition for the 
Holmdel srsyu tests reported in Sections 3.2 and 3.3. Circuit noise 
was 30 dBrnC. Side tone path loudness loss was about 12 dB and 
average room noise was estimated to be about 42 dB(A). 

The test conditions and results are given in Table XIV. As with 
results of echo tests discussed in preceding sections, these results show 
that transmission quality is strongly dependent on echo-path delay 
and echo-path loudness loss. 


V. ANALYSIS OF INDIVIDUAL TEST DATA 


The raw test results from any individual test provide subjective- 
opinion information expressible in the form of percent of ratings in 
each of the five rating categories for each test condition. The utilization 
of these raw test results in this form for transmission planning is 
difficult because it is usually necessary to have ratings available for 
transmission parameter values not specifically included in the tests. 
Thus, some form of data analysis 1s frequently applied to obtain 
graphical or analytical representations of the data that are more 
convenient for use in transmission planning studies. This can involve 
simple curve fitting to the raw test data®® or more elaborate models of 
subject ratings using binomial or other distributions.” 

For example, Lewinski in Ref. 5 provides separate ‘“‘smooth’’ fits 
for the percentage of responses in the categories excellent, good or 
better, fair or better, and poor or better as a function of circuit noise. 
Similarly Sen in Ref. 6 provides mathematical expressions and contours 
for the percent good or better and percent poor or worse as a function 
of connection loss and noise. A different approach is suggested by 
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Table XIV—1970 SIBYL echo test 


Connection loudness loss = 10 dB 
Circuit Noise = 380 dBrnC 


Echo Path Percent of Subjects’ Votes in Each Category 
Number 
Loudness o} 
Delay Loss Ratings | fxcel. Good Fair Poor | Unsat. 
(ms) (dB) 
— 00 183 


|S | | | 


32.2 36.6 24.6 5.5 1.1 
10.0 5.8 51 0 15.7 29.4 47.1 7.8 
10.8 23 0 8.7 30.4 47.8 13.1 
15.8 38 15.8 21.0 39.5 21.0 2.7 
20.8 45 — 8.9 53.3 20.0 13.3 4.5 
25.8 41 19.5 51.2 29.3 0 0 
36.0 14.4 38 0 2.6 18.4 50.0 29.0 
19.4 o¢ 2.7 0 29.7 37.9 29.7 
24.4 ol 11.8 30.3 23.5 23.5 5.9 
29.4 ol 6.4 35.6 25.8 25.8 6.4 
34.4 46 19.6 06.9 28.3 13.0 2.2 
72.0 22.4 35 0 2.9 2.9 65.7 28.5 
27.4 20 10.0 0 15.0 60.0 15.0 
32.4 34 5.9 8.8 29.4 38.2 17.7 
O74 33 12.1 24.2 30.4 24.2 9.1 
42.4 46 13.1 47.8 30.4 6.5 2.2 


Prosser, Allnatt, and Lewis in Ref. 20 where they point out that five 
separate mathematical functions can be specified, one for each grade 
on the rating scale, but advocate the desirability of a more convenient 
and compact representation by means of a single mathematical model 
that embraces all five functions. They examined various models based 
on the binomial distributions as well as logistic and gaussian curves. 
They adopted the second-order binomial as the simplest adequate 
model to describe the opinion distribution found in their experiment. 

We also recognized the advantages of a single mathematical model 
to represent the distribution of opinion in the five rating categories. 
The normal density curve was selected as a basis for the model 
described in the following sections because it provided somewhat 
greater flexibility in accommodating a variety of standard deviations. 
Because of the availability of digital computers for the data analysis, 
the additional computational complexity associated with the normal 
distribution was not judged to be a problem. 


5.1 Analysis method 


The subjective-test results for each test condition of connection 
loudness loss, Z,, and circuit noise, NV, form a vote histogram containing 
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the proportion, P;, of ratings for each of the rating categories, 7 = 1, 
2, 3, 4, 5. Rating category 1 represents the unsatisfactory category, 2 
represents poor, 3 represents fair, 4 represents good, and 5 represents 
excellent. 

The P,’s for each test condition sum to unity. 

The values of P; for each test condition were used to calculate the 
mean opinion score (Mos) and sample standard deviation (SIGMOS) as 
in eqs. (4) and (5), respectively. 


mos = >: iP; (4) 
i=1 
5 2 
SIGMOS = >» ?P;- (10s) (5) 
i=1 


The vote histogram was represented by a normal density curve with 
mean, uw, and standard deviation, o. The area under this curve was 
divided into five regions, each with area P;. The areas were defined as 
follows: from minus infinity to 1.5 as P,, from 1.5 to 2.5 as P., from 
2.5 to 3.5 as P;, from 3.5 to 4.5 as P:, and from 4.5 to infinity as P;; 
the P;’s sum to unity. This quantization of the area under the normal 
curve into five discrete regions was the basis for using the normal curve 
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Fig. 12—MH loss-noise test data percentages as a function of the Step 3 normal- 
density means compared with percentages predicted using the MH standard devia- 
tion, 1.44. 
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Fig. 13—-HO1 loss-noise test data percentages as a function of the Step 3 normal- 
density means compared with percentages predicted using the HO1 standard devia- 
tion, 1.02. 


to represent this type of data. These P;’s were used to compute the 
pseudo mean opinion scores (MOSg) and sample standard deviations 
(SIGMOSg) in terms of the mean, yp, and standard deviation, co, of the 
normal curve as given in eqs. (6) and (7), respectively. 





Sio=be Ses, ( 5) a| (6) 
MOSg = LH o> = _ 9 ; 
See Lhe me 2 
5 = 1 
SIGMOSg = > Pi - (1080) 
i=l 
4 27 +1 (j+0.5—p) /o ( 5) 
= 195 — i exp( — 5 } dt 
- 2 2 


1 
2 


= (at0se)*} - (7) 


Step 1 in the four-step analysis was to find a normal density curve 
for each test condition such that Mos = MOSg and SIGMOS = SIGMOSg. 
These two constraints were used in an iterative computer procedure 
to determine the values of the mean, yp, and standard deviation, oc, of 
the normal density curve used to represent the data for each test 
condition. 
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Three other criteria were considered in the determination of a 
normal density curve during the development of this analysis pro- 
cedure. One criterion was to determine yw and o such that the propor- 
tions good or better (Ps + Ps) and poor or worse (P; + P2) were the 
same for the data and the normal curve. Another was to determine pz 
and o to minimize the sum of the squares of the differences between 
P, and P,, P; + P2 and P, + Po, P; + Po + P3 and P, + P. + Ps, 
and finally P; + P2 + P; + Ps and P, + P. + P; + Ps. The third 
criterion was to determine » and o to minimize the sum of the square 
of the differences P; — P;,7 = 1, 2, ---, 5. The selection of Mos = MOSg 
and SIGMOS = SIGMOSg was chosen as the general criterion of fit, since 
the other criteria were found to be more sensitive to the experimental 
variability inherent in subjective data of this type, particularly when 
the number of subjects used is small (less than about 100). This 
selection was made after applying the several analysis criteria to several 
hundred sets of test data generated by Monte Carlo simulation. 

After all the test conditions were represented by a normal density 
curve with mean, yp, and standard deviation, c, an average of the stan- 
dard deviations for all of the test conditions was computed in Step 2. 
In determining this average, the individual o’s were weighted in 
accordance with the number of votes per condition and by the weight- 
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Fig. 14—HO2 loss-noise test percentage as a function of the Step 3 normal-density 
means compared with percentages predicted using HO2 standard deviation, 0.998. 
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Fig. 15—1968 laboratory echo data percentages (noise=18 dBrnC) as a function 
of the Step 3 normal-density means compared with percentages predicted using the 
echo standard deviation, 0.888. 


ing given in eq. (8). This latter weighting was based on the analyses 
of the Monte Carlo data mentioned previously by examining the 
variations in the standard deviations as a function of the fit mean, uy. 

1 2 
4, oe 8” 
1 + 3 


Weighting = (8) 


The weighted average standard deviation, o., was then used in 
Step 3 as the standard deviation for all of the normal density curves. 
Using oq, a new mean, “a, was computed for each test condition subject 
to the constraint, MoS = MOSg. The end result was a family of normal 
density curves, all with the same standard deviation and with means 
as determined above. 

Figures 12 to 19 illustrate the results from the first three steps in 
the procedure for several of the tests described in this paper. In these 
figures the cumulative percent of ratings in four categories—unsatis- 
factory, unsatisfactory plus poor, unsatisfactory plus poor plus fair, 
unsatisfactory plus poor plus fair plus good—are plotted against the 
fit mean, a, determined in Step 3. The solid curves are plotted using 
the weighted average standard deviation, oa, from Step 2. Also shown 
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are the raw data plotted against the respective fit mean, ua, for each 
condition. These figures show that the normal density curves defined 
by the values, waz, and an average standard deviation, o., provide 
a convenient and simple representation of the raw data for any 
single test. 

In Step 4, the means of all the normal curves are fitted by a suitable 
analytical function of the test parameters using a least-squares-fit 
technique. 

In summary, the steps involved in the analysis procedure can be 
described as follows: 


Step 1. A normal density curve is used to represent the vote histo- 
gram for each test condition such that Mos = MOSg and 
SIGMOS = SIGMOSg. 

Step 2. The standard deviations of the normal curves of Step 1 for 
all test conditions under consideration are weighted and 
averaged to obtain a single value for the standard devi- 
ation, oa. 

Step 3. The single value of standard deviation from Step 2 is used 
for each test condition as the standard deviation of the 
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Fig. 16—1968 laboratory echo data percentages (noise=28 dBrnC) as a function 
of the Step 3 normal-density means compared with percentages predicted using the 
echo standard deviation, 0.888. 
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Fig. 17—1968 laboratory echo data percentages (noise=38 dBrnC) as a function 
of the Step 3 normal-density means compared with percentages predicted using the 
echo standard deviation, 0.888. 
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Fig. 18—s1Byu echo data percentages as a function of the Step 3 normal-density 
means compared with percentages predicted using the echo standard deviation, 0.888. 
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Fig. 19—1970 laboratory echo data percentages as a function of the Step 3 normal- 
density means compared with percentages predicted using the echo standard devia- 
tion, 0.888 

corresponding normal curve, and the mean of the normal 
CUrVe, a, 1S adjusted such that MOS = MOSg. 

Step 4. The means of all normal curves of Step 3 are fitted, on a 
least-squares-error basis, to an appropriate function of the 
test parameters. 


The results at each step of this analysis are summarized in Table 
XV for HO2 loss/noise data. 

In this table, the results presented for Step 4 are based on the 
analytical function [eq. (11) in the next section] fitted in Step 4. 
Comparison of the entries for Step 4 with the entries for the previous 
steps illustrate the extent to which this analytical function provides 
a good fit to the test results. For this test, all of the fit means in Step 4 
are within approximately 0.2 of the fit means in Step 3. The individual 
differences have a mean of 0.035 and a standard deviation of 0.13. 
This agreement is considered reasonable in view of the average stan- 
dard deviation of 0.998 and about 50 to 70 ratings per condition. Some- 
what larger differences were obtained in the MH and HO! tests where 
the number of ratings per test condition was smaller. 


5.2 Loss-noise analysis 
The results obtained by applying the analysis method to the data 
from the three loss/noise tests are given in eqs. (9), (10), and (11), 
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Table 


XV—Summary of results for HO2 loss-noise test 
at each step in the analysis procedure 


Step in the 
Fit Process 

Excel. 
Raw Data 49.49 
Step l 51.54 
Step 2 51.71 
Step 3 49.16 
Step 4 50.00 
Raw Data 47.23 
Step 1 48.79 
Step 2 48.88 
Step 3 50.40 
Step 4 50.00 
Raw Data 14.00 
Step 1 11.81 
Step 2 12.11 
Step 3 12.11 
Step 4 12.26 
Raw Data 3.23 
Step 1 1.26 
Step 2 0.97 
Step 3 0.99 
Step 4 1.03 
Raw Data 27.14 
Step 1 25.24 
Step 2 26.89 
Step 3 27.45 
Step 4 25.42 
Raw Data 18.97 
Step 1 19.46 
Step 2 21.14 
Step 3 21.46 
Step 4 27.72 
Raw Data 4.69 
Step 1 1.40 
Step 2 3.44 
Step 3 3.37 
Step 4 5.44 
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Percentage of Votes 


Good 





Fair 





Poor 


Unsat. 





Mean 
Opinion 
Score* 


Standard 
Deviation* 


Test Condition 1, Loss = 5dB, Noise = 25 dBrnC 


35.35 
30.94 
33.48 
34,51 
34.18 


13.13 
14.15 
12.77 
13.96 
13.56 


2 


bone 


Test Condition 2, Loss 


40.62 
36.59 
34.61 
34.02 
34.18 


10.20 
12.98 
14.10 
13.38 
13.56 


Test Condition 3, Loss 


26.00 
31.46 
31.24 
31.24 
31.38 


40.00 
36.85 
36.45 
36.45 
36.37 


18.00 
16.75 
16.88 
16.88 
16.73 


Test Condition 4, Loss 


Test condition 5, Loss 


35.71 
40.95 
38.13 
38.19 
37.91 


Test condition 6, Loss 


39.66 
39.06 
36.80 
36.91 
38.22 


32.26 
27.39 
27.83 
28.04 
28.34 


31.43 
27.16 
26.72 
26.35 
27.70 


32.76 
31.65 
30.60 
30.38 
26.17 


35.48 
36.40 
37.88 
37.82 
37.73 


2 
.20 
02 


6.90 
8.93 
10.09 
9.91 
7.10 


.02 
32 


4.303 
4.303 
4,348 
4.303 
4.318 


0.846 
0.846 
0.782 
0.801 
0.795 


= 10 dB, Noise = 25 dBrnC 


4.325 
4.325 
4,298 
4.325 
4.318 


0.762 
0.762 
0.803 
0.792 
0.795 


= 20 dB, Noise = 25 dBrnC 


wWwwre © 
Oo et bet HS 


3.320 
3.320 
3.320 
3.320 
3.326 


0.989 
0.989 
0.998 
0.998 
0.998 


= 30 dB, Noise = 25 dBrnC 


25.81 
26.16 
25.23 
24.91 
24.47 


0.00 
0.48 
8 

9 


S09 
CO 
moO > 


1.72 
0.89 
1.37 
1.34 
0.79 


2.226 
2.226 
2.217 
2.226 
2.238 


3.672 
3.672 
3.662 
3.672 
3.850 


0.974 
0.974 
0.946 
0.947 
0.949 


= 5 dB, Noise = 32 dBrnC 


0.889 
0.889 
0.938 
0.935 
0.945 


10 dB, Noise = 32 dBrnC 


0.917 
0.917 
0.965 
0.963 
0.933 


Test condition 7, Loss = 20 dB, Noise = 32 dBrnC 


6.25 
14.76 
17.24 
17.03 
21.94 


48,44 
42.65 
36.64 
36.53 
38.19 


34.38 
33.60 
30.91 
31.11 
26.39 


6.25 
7.59 
11.77 
11.97 
8.03 


2.688 
2.688 
2.697 
2.687 
2.904 


0.864 
0.864 
0.999 
0.999 
1.007 


Fit 
Mean 


4.543 
4.543 
4.479 
4.500 


4,472 
4,472 
4.510 
4.500 


3.333 
3.333 
3.333 
3.340 


2.166 
2.166 
2.176 
2.190 


3.885 
3.885 
3.902 
3.840 


3.700 
3.700 
3.711 
3.910 


2.684 
2.684 
2.674 
2.900 
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Fit 
Sigma 


1.117 
0.998 
0.998 
0.998 


0.923 
0.998 
0.998 
0.998 


0.985 
0.998 
0.998 
0.998 


1.043 
0.998 
0.998 
0.998 


0.922 
0.998 
0.998 
0.998 


0.929 
0.998 
0.998 
0.998 


Table XV— Continued 


Percentage of Votes M 
ean Standard Fit 


Roe Deviation* | Mean 


Step in the 
Fit Process 


Excel. | Good Fair Poor | Unsat. 














Test condition 8, Loss = 30 dB, Noise = 32 dBrnC 


Raw Data 1.92 7.69 15.38 | 51.92 23.08 2.135 0.920 — 
Step 1 0.67 6.66 | 25.95 | 38.87 27.85 2.135 0.920 2.076 
Step 2 0.76 6.92 | 25.87 | 38.26 28.19 2.138 0.931 2.076 
Step 3 0.75 6.87 | 25.78 | 38.27 28,33 2.135 0.930 2.072 
Step 4 0.45 4.89 | 21.72 | 38.15 34.80 1.980 0.895 1.890 
Test condition 9, Loss = 5 dB, Noise = 42 dBrnC 
Raw Data 11.32 | 11.32 | 35.85 | 33.96 Ceo) 2.849 1.088 —_— 
Step 1 6.83 | 20.85 | 34.25 | 26.55 11.51 2.849 1.088 2,839 
Step 2 4.80 | 20.59 | 37.91 27.72 8.98 2.845 1.005 2.839 
Step 3 4.84 | 20.67 | 37.93 | 27.63 8.92 2.849 1.005 2.843 
Step 4 5.23 | 21.50 | 38.11 | 26.83 8.34 2.884 1.006 2.880 
Test condition 10, Loss = 10 dB, Noise = 42 dBrnC 
Raw Data 17.53 | 17.53 | 32.99 | 24.74 7.22 3.1384 1.181 — 
Step 1 14.29 | 24.81 | 30.53 | 20.76 9.60 3.134 1.181 3.150 
Step 2 8.81 27.48 | 37.97 20.83 4.91 3.144 1.005 3.150 
Step 3 8.63 | 27.24 | 38.02 | 21.07 5.03 3.134 1.006 3.139 
Step 4 7.45 | 25.51 | 38.30 | 22.83 5.90 3.058 1.007 3.060 
Test condition 11, Loss = 20 dB, Noise = 42 dBrnC 
Raw Data 4.35 10.87 19.57 | 41.30 23.91 2.304 1.081 mea 
Step 1 2.92 11.45 26.35 31.67 27.61 2.304 1.081 2.217 
Step 2 1.11 8.82 | 28.91 | 37.54 23.62 2.262 0.953 2.217 
Step 3 1.25 9.51 29.86 | 37.16 22.23 2.304 0.960 2.263 
Step 4 1.24 9.46 29.79 | 37.18 22.32 2.301 0.959 2.260 
Test condition 12, Loss = 30 dB, Noise = 42 dBrnC 
Raw Data 0.00 0.00 | 25.00 0.00 75.00 1.500 0.866 = 
Step 1 0.98 3.30 9.20 | 17.76 68.75 1.500 0.866 0.705 
Step 2 0.01 0.25 3.35 17.68 78.71 1.252 0.521 0.705 
Step 3 0.05 1.06 8.84 | 28.93 61.12 1.500 0.705 1.218 
Step 4 0.12 1.98 | 13.00 | 33.70 51.20 1.661 0.786 1.470 


Fit 
Sigma 


1.116 
0.998 
0.998 
0.998 


1.265 
0.998 
0.998 
0.998 


1.206 
0.998 
0.998 
0.998 


1.626 
0.998 
0.998 
0.998 


* Mean opinion score and standard deviation are calculated from the percentage of votes given in 


each table for the corresponding step in the fit process. 


respectively, for the MH, HOI, and HO2 tests. 


uuu = 11.54 — 0.1099|L, — 11.7| — 0.168N1 — 0.001059L.N, 
1.44, 


o 


Huo. = 7 — 0.1365|Z,. — 10.31| — 0.1219N2 + 0.001577L.N 2 
o¢ = 1.02, 


Huo. = 7.17 — 0.1681|L, — 6.7| — 0.1058N;3 + 0.002106L.N; 
« = 0.998, 
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where 


L, = Acoustic-to-acoustic loudness loss (in dB) of an overall tele- 
phone connection, determined using the Electro-Acoustic 
Rating System (£aRs) method. 

N = Circuit noise (in dBrnC) at the input to a set with a receive- 
loudness rating of 26 dB, determined using the Ears method. 

N, = Total noise in dBrnC resulting from power addition of the 

circuit noise, NV, from the MH tests with 34.03 dBrnC. 

Total noise in dBrnC resulting from power addition of the 

circuit noise, NV, from the HO1 tests with 23.76 dBrnC. 

N; = Noise, N from the HO2 tests. 


N» 


The values 34.03 and 23.76 were determined as fit parameters. The 
particular functional form was selected to provide as simple a model 
as possible of the systematic effects observed in the data. 

The results represented by eqs. (9), (10), and (11) revealed two 
important differences between the MH and HO tests. First, the 
standard deviation, o, was considerably larger for the MH tests than 
for either HO1 or HO2. Second, the subjective opinions, as represented 
by the means, yp, calculated from eqs. (9), (10), and (11) were con- 
siderably higher in the MH test compared with the HO tests. These 
differences occurred despite the similarities of the tests. A careful 
examination of either the raw data or the smooth results clearly shows 
that the subjects’ ratings tended to be more critical in the two HO 
tests compared with the subjects in the MH test. 

A clearer picture of the differences is obtained by selecting a set of 
loss (Z.) and circuit noise (V) values over a common range of the 
tests for MH, HO1, and HO2 and computing the corresponding values 
of the means (u) for the three tests from eqs. (9), (10), and (11). If 
plots are made of the MH means versus both the HOI] and HO2 
means and the appropriate linear regression made for both plots, then 
eqs. (12) and (138), respectively, represent the regression line between 
the MH and HO1 means and the MH and HO2 means. Such a plot 
is shown in Fig. 20 for MH and HO1. 


umn = 1.372uH01 — 0.206. (12) 
MMH = 1.288un02 =o 0.215. (13) 


The Pearson product moment coefficient of correlation was found to 
be 0.9586 and 0.9693 for MH with HO1 and HO2, respectively. Eqs. 
(12) and (13) clearly show the difference between the means for the 
MH and the two HO tests. These equations also show the close 
agreement between the two Holmdel tests. 
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MURRAY HILL FIT MEAN 


~~ yy = -0.206 + 1.372 wo, 





0 1 2 3 4 5 
HOLMDEL FIT MEAN 


Fig. 20—Comparison of 1965 Murray Hill and 1969 Holmdel 1 loss-noise results. 


Equations (12) and (13) are used to adjust for the difference be- 
tween the HO results and the MH results in Section VI of this paper. 


5.3 Echo analysis 


The four echo subjective tests yielded data on the subjective effects 
of echo-path loudness loss, #, in dB and echo-path delay, D, in ms. 
Circuit noise, NV, was not the same for all four tests, and needed to be 
considered as a test variable. Loudness loss, L., was a factor in only 
two of these tests (the other two were listening only tests) and, as a 
first approximation, it was decided to ignore L, and concentrate on £, 
D, and N in the analysis. 

Preliminary analyses of the individual test data indicated that there 
were only relatively small differences in the absolute ratings among the 
1968-1970 tests. Thus, it was feasible to combine these data and use 
the analysis method described previously. The resulting equation 
relating the normal density means to the test variables, as realized 
at Step 4 1n the analysis, was a function of #, D, and N, where N for 
this preliminary analysis was the actual noise at the telephone set 
terminals. The effect of noise was asymptotic. That is, the fit mean, y, 
was determined largely by the values of # and D when echo was the 
predominant impairment, while, for any value of D, increasing the 
value of & gradually led to the mean being solely determined by the 
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value of NV. This asymptotic effect of the noise made it relatively simple 
to separate the effects of noise and echo in the function depicting the 
mean as a function of the test variables. The resulting functions are 
given in eq. (14). 


un = 4.74 — 3.32 logio [(1 + D)/V1 + (D/480)?] + 0.1414E 


uv = 6.88 — 0.094N 
ae 2 
uve = be TAN — (“4 ) + (0.627) (14) 
o = 0.888. 


Although these functions were a useful interim result, subsequent 
analysis described in the later sections of this paper indicated that 
when the effects of both loss and noise were included and the noise was 
referred to the input of a reference set, further modifications were 
necessary. These functions are included here because they provided a 
basis for the subsequent modifications. 


VI. COMBINATION OF MODELS 


The results of the three loss-noise tests as given by eqs. (9), (10), 
and (11) showed fundamental differences among the tests. Despite 
the similarity of the tests and the general character of the results, 
both the raw data and the smoothed results showed that the subjects’ 
ratings in the two HO tests tended to be more critical in their evalu- 
ations than they were in the MH tests. This could have occurred 
because of one or more differences in the tests, such as room noise, . 
sidetone path loss, year of test, or some fundamental difference in the 
attitude of the subject groups. The frequent repetition of high-quality 
conditions in the Holmdel tests may also have been a factor. 

The exact reasons for the differences in the test results could not 
be determined. Because of these differences, direct pooling of results 
from the three tests did not appear to be justifiable. However, the 
test results were combined by adjusting the HO! and HO2 results to 
a MH base using the linear transformation obtained from the linear 
regressions introduced previously in eqs. (12) and (18). In this way, 
the systematic differences among the test results were preserved, while 
achieving the advantages of a larger data base. The transformed means 
were then included with the MH means, and a new equation was 
obtained by applying Step 4 to this combined set of means. Table XVI 
shows the fit means at Step 3 for the MH, HO1, and HO2 test results 
and the adjusted fit means for the HO1 and the HO2 test results. Thus, 
the final fit was based on a total of 51 conditions. The final result 
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Table XVi—-Data for combined fit—Step 3 


Connection 
Loudness 


Loss 


(dB) 


10 
20 
25 
30 
10 
20 
25 
10 
20 


30 


Holmdel 1 
Circuit Fit 
Noise Mean 
(dBrnC) (ua) 
22 4.18 
22 3.94 
22 3.35 
22 2.36 
22 2.46 
35 3.17 
35 2.54. 
35 1.94 
35 1.86 
40 2.97 
40 1.73 
40 1.72 
45 2.06 
45 1.76 
45 0.93 


Adjusted 


Fit 


4.14 
3.28 
2.46 
2.35 
0.87 
2.17 
2.15 
2.62 
2.21 
1.07 


Connection 
Loudness 


Loss 
(dB) 


10 
20 
30 


Holmdel 2 
Circuit Fit 
Noise Mean 
(dBrnC) (ua) 
25 4.48 
25 4.51 
25 3.33 
25 2.18 
32 3.90 
32 3.71 
32 2.67 
32 2.07 
42 2.84 
42 3.14 
42 2.26 
42 1.22 


0.44 
3.83 


2.70 
1.36 


is eq. (15). 


uma = 10.36 — 0.185V(L, — 7.2)? + 1 — 0.1647N p 
+ 0.00167LzNp, (15) 


where 


Nr = Power addition of noise with 27.37 dBrnC. 


6.1 General rating scale 


Equation (15) above is calculated in terms of the MH base. This 
equation together with eqs. (12) and (13) can be used to express the 
representation of the subjective ratings in terms of any one of the 
other test bases, HO1 or HO2. Each test base has a standard deviation 
associated with it that can then be used in conjunction with the 
computed means to calculate predicted vote histograms from the 
normal density curves. However, if this is done, the fit means and vote 
histograms will be different for each test in accordance with the 
difference in absolute ratings obtained for each test. To eliminate the 
need for three separate equations, one for each test, a general trans- 
mission-rating scale was established. 

This transmission rating scale, referred to as the R-scale, is simply 
a linear transformation of the normal density means, defined by eq. 
(15), with the constraints that two preselected transmission conditions 
are to be the anchor conditions for the transformation. R-scale values 
of 80 and 40, respectively, were selected for the transmission conditions 
L. = 15, N = 25, and L, = 30, N = 40. These two transmission con- 
ditions were selected to be well separated in quality. The first pair is 
typical of a short intertoll connection and the latter represents an 
extreme condition of loss and noise that should rarely occur even on 
long intertoll connections between long loops. 

Using the above transmission conditions as anchors, f-scale values 
can be specified in terms of » for each test through the linear trans- 
formation & = a+ by, with a and 6b determined from the anchor 
constraints. 


6.2 Loss-noise model 


From eq. (15), the transmission condition Lz = 15, N = 25 yields 
umu = 4.806 and L, = 30, N = 40 yields umn = 1.528. Using these 
values of u, respectively, with R-scale values of 80 and 40 determines 
the transformation to the R-scale from the yu scale as given in eq. (16). 


R= 21.37 + 12.20umu. (16) 
Substituting eq. (15) for umn into eq. (16) gives eq. (17), which is 
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THIRD—STEP FIT MEAN, {ta 
w 


DATA 

21 dBrnC NOISE 

28 dBrnC NOISE 
1 36 dBrnC NOISE 

44 dBrnC NOISE 





0 5 10 15 20 25 30 
CONNECTION LOUDNESS LOSS IN dB 


Fig. 21—MH loss-noise test, Step 3 means from data as a function of loudness loss 
and noise compared with means predicted from the loss-noise model at the MH base. 


the R-scale representation of the subjective opinion for loss and noise. 


Riy = 147.76 — 2.257V(L, — 7.2)? + 1 — 2.009N r + 0.02037L,N r. 
(17) 


Equation (17) is plotted in Fig. 1 as transmission rating versus L, for 
a selected set of values of NV. These curves represent the predicted 
transmission rating, in terms of the R-scale, for selected values of 
DL, and N. : 

The #-scale result of eq. (17) can also be used in conjunction with 
the appropriate standard deviations associated with eqs. (9), (10), 
and (11) and the appropriate inverse linear regression lines of eqs. 
(12) and (18) to calculate percent good or better, poor or worse, or 
other characteristics at the chosen test base. 

For the u-scale, the proportion of ratings good or better is the 
integral of the standard normal density curve from (3.5 — p)/o to 
infinity. In the R-scale, this corresponds to the integral of the standard 
normal density curve from minus infinity to [R — (a + 3.5b) ]/ob. 
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Similar computations can be made for proportion poor or worse, or 
for proportions of ratings in any of the five categories. The appropriate 
limits of integration to compute the proportion of ratings good or 
better and poor or worse are given in Table II for the three loss-noise 
test bases (MH, HO1, HO2). | 

The above discussion concerning the relationships between the 
proportions good or better and poor or worse and the R-scale lead to 
the plots of Fig. 4 which show these relationships for the three 
test bases. 

Finally, the results summarized in Table II were also used to 
generate curves showing the tradeoff between LD, and N for selected 
values of percent good or better and percent poor or worse as shown 
in Figs. 6 and 7, respectively. As noted on the figures, these results 
correspond to the MH data base. | 

In Figs. 21 to 28, the third-step fit means for the individual tests 
are plotted as a function of loudness loss with circuit noise as a param- 
eter. The solid lines in the figure correspond to the values of trans- 
mission rating in the final model transformed by the appropriate 
relation between » and F for each test. These figures show a generally 
good fit to the individual test results for each of the tests. 


THIRD-STEP—FIT MEAN, 






DATA 

Vi 22dBrnC NOISE 
1 O 35dBrnC NOISE 
O 40dBrnC NOISE o 
M4 45dBrnC NOISE 


0 
0 5 10 18 20 25 30 


CONNECTION LOUDNESS LOSS IN dB 


Fig. 22—HO1 loss-noise test, Step 3 means from data as a function of loudness loss 
and noise compared with means predicted from the loss-noise model at the HO1 base. 
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VY 25dBrnC NOISE 
© 32 dBrnC NOISE 
A 42 dBrnC NOISE 


THIRD-STEP FIT MEAN, 





0 5 10 15 20 25 30 
CONNECTION LOUDNESS LOSS IN dB 


Fig. 283—HO2 loss-noise test, Step 3 means from data as a function of loudness loss 
and noise compared with means predicted from the loss-noise model at the HO2 base. 


6.3 Talker-echo model 


The R-scale was introduced previously in terms of LZ, and N. Thus, 
it was necessary to use the conversational echo test results which 
included both loss and noise as parameters to establish an expression 
for the echo results on the AR-scale. The 1970 srpyu test was used as a 
basis for this conversion. As indicated in Table XIV, this test included 
a base condition with ZL, = 10 dB and N = 30 dBrnC for which the 
fit mean, wp, was 4.01. The R-scale value corresponding to this combi- 
nation of loss and noise is 83.47. Similarly, the results for the noise 
condition in the 1968 lab tests were taken into account but given less 
weight because they were obtained in a less realistic test environment. 
A further aid in deriving this conversion was the HO1 sisyu test for 
loss and noise which preceded the 1970 siBpyL echo test and was 
conducted in the same manner with the same subjects. The final 
conversion is given in eq. (18) below. 


R= 18.7 + 16.1yz1. (18) 


This relationship is almost identical to that of the HO1 siByt tests 
at low values of u. However, a slight correction was included for higher 
values of » to take account of the base condition in the 1970 sIBYL 
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echo test. For the HO1 test the value of » corresponding to the base 
condition (R = 83.47) was 3.86. The relationship in eq. (18) gives a 
value of 4.02 and provides good agreement with the actual value of u 
which was 4.01 in the 1970 srpyu echo tests. 

Substituting eq. (14) for uz: into eq. (18) yields eq. (19) which is 
the transmission-rating model for echo. 


Ry = 95.01 — 53.45 logio [(1 + D)/V1 + (D/480)?] + 2.277E. (19) 


Equation (19) is plotted in Fig. 2. 

In the analysis above, eq. (19) was derived from eqs. (14) and (18) 
to provide excellent agreement for the 1970 s1ByL echo tests. However, 
these same relationships did not provide good agreement with the 1970 
lab tests which had a higher loss included in all conditions. The base 
condition for this test as given in Table XII, with LZ, = 18 dB and 
N = 33 dBrnC (R = 67.36), had a fit mean, p, of 3.59. 

The equation, 


R= 20 + 13.33u2z9, (20) 


provided a good match at this point and retained the relationship 
between » and R at low values of » obtained previously for the sIBYL 
echo tests. With the relationship defined by eq. (20), the transmission- 
rating model for echo given in eq. (19) provided an excellent repre- 
sentation of the results from the 1970 lab test for echo. The extent of 
the agreement is illustrated in Section 6.4 where the combined loss- 
noise-echo model is discussed. 


6.4 Loss-noise-echo model 


In the development of the echo results, 1t was noted that the deg- 
radations considered were echo-path-loudness loss (/), echo-path 
delay (D), loudness loss (Z.), and noise (VV). Loss and noise were 
eliminated in the final echo result because it was felt that for any 
combined result, the loss and noise influence should be based on the 
larger data base available from the s1ByL tests. | 

The original analysis of the echo-loss-noise data showed that the 
loss and noise really only affected the circuit performance as an 
asymptote. That is, for very large HL, the ratings were determined by 
LL, and N. Use was made of this fact when the echo result of eq. (14) 
was developed. The combination of Rry and Rg was made such that 
this asymptotic behavior was retained in the final model. 

The final result for loss, noise, and echo is presented in terms of the 
k-scale as Rryg. This final result is shown in eq. (21). 


R _ Rin+ Re (= — Reg 
ne. —— 


: ; ) +, (21) 
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Fig. 24—1968 laboratory echo test (noise=18 dBrnC, loss=10 dB) Step 3 means 
from data as a function of echo-path loss and delay compared with means predicted 
from the loss-noise-echo model at the echo 1 base. 


ECHO-—PATH DELAY, 
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20 
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Fig. 25—1968 laboratory echo test (noise=28 dBrnC, loss=10 dB) Step 3 means 
from data as a function of echo-path loss and delay compared with means predicted 
from the loss-noise-echo model at the echo 1 base. 
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ECHO-—PATH DELAY, D, IN ms=1.5 


DATA 


SYMBOL DELAY (ms) 
Vv 1.5 


O 20 
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Fig. 26—1968 laboratory echo test (noise=38 dBrnC, loss=10 dB) Step 3 means 
from data as a function of echo-path loss and delay compared with means predicted 
from the loss-noise-echo mode] at the echo 1 base. 


ECHO—PATH DELAY, 
D, IN ms=10 


DATA 
SYMBOL DELAY ms 
ray 10 
Oo 36 
Oo 72 


THIRD-STAGE FIT MEAN, fta 





0 10 20 30 40 50 60 70 
ECHO—PATH LOUDNESS LOSS, E, IN dB 


Fig. 27—s1Byt echo test, Step 3 means from data as a function of echo-path loss 
an rane compared with means predicted from the loss-noise-echo model at the 
echo 1 base. 
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Fig. 28—1970 laboratory echo test, Step 3 means from data as a function of echo 
path loss and delay compared with means predicted from the loss-noise-echo model 
at the echo 2 base. 


With C = 0, Rrivz is simply the lesser of Rry and Rg. The factor C 
is selected to represent the additional degradation when Rry and Rz 
are nearly equal. The value of C = 10 was based on echo tests that 
included echo, loss, and noise and was obtained as the product of the 
constant, 0.627, in eq. (14) and the slope of the one relating R and yp 
given in eq. (18). 

The final result in terms of the R-scale is: 


Rive = AE (SSA) + ao. 22) 


For high echo-path-loudness loss, the ratio is determined mainly by 
connection loudness loss and circuit noise and the result reduces to 
the Rry result. Similarly, for connection loudness loss near optimum 
and low circuit noise, the rating is determined mainly by the echo, and 
the result effectively reduces to the Rz result. 

Comparison of the final model for loss, noise, and echo with the 
third-step-fit means for the individual tests are shown in Figs. 24 to 
28. As in the case of the loss-noise model, the final loss-noise-echo model 
provides good agreement with the test results from each of the in- 
dividual tests. 
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APPENDIX A 
Examples Demonstrating Use of the Transmission-Rating Models 
A.1 Example 1—local connection 


500-type telephone sets on a calling and a called customer loop, each 
of which consists of 1 kilofoot of 26-gauge nonloaded cable: 


Loss (L.): TLR = —23.6 dB 


RLR = 26.3 dB 
DL. = (TLR + RLR) dB 
= 2.7dB 


Noise (V): Nr = Total noise at each telephone set from two 
loops, each of which meet the 20 dBrnC 
loop-noise objective.® 

= 23 dBrnC 
N = Total noise referred to a telephone set with 
RLR of 26 dB 
= 27 dBrnC. 


Talker Echo (assumed negligible) : 


Using Table II with L, = 2.7 dB and N = 27 dBrnC, 
Rin = 78.8. 
Using Table II for the MH data base, 
GoB = 79.2%, .PoW = 6.6%. 


A.2 Example 2—local connection 


500-type telephone sets and a calling and a called customer loop, 
each of which consists of 8 kilofeet of 26-gauge nonloaded cable: 


Loss (Z,): TLR = —18.9 dB 


RLR = 27.3 dB 
DT, = 84 dB 
Noise (NV): Assumed to be the same as for Example 1: 
N = 27 dBrnC. 
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Talker echo (assumed negligible) : 
Using Table II with LZ, = 8.4 dB and N = 27 dBrnC, 
Rin = 88.7. | 
Using Table II for the MH data base, 
GoB = 92%, PoW = 1.8%. 


A.3 Example 3—local connections 


500-type telephone sets on a calling and a called customer loop, 
each of which consists of 15 kilofeet of 26-gauge nonloaded cable: 


Loss (L,): TLR = —13.2 dB 


RLR = 30.1 dB 
L, = 16.9 dB 
Noise (V): Assumed to be the same as for Example 1. 
N = 27 dBrnC. 


Talker echo (assumed negligible) : 
Using Table IJ with LZ, = 16.9 dB and N = 27 dBrnC, 
Rin = 75.6. 
Using Table II for the MH data base, 
GoB = 74.2%, PoW = 8.9%. 


A.4 Example 4—toll connection 


500-type telephone sets on a calling and a called customer loop, each 
of which consists of 8 kilofeet of 26-gauge nonloaded cable: 


Loss (L,): TLR =—18.9 dB 
RLR = 27.3 dB 
L = Class 5 office-to-Class 5 office loss 
7.7 dB (mean for the connection-length cate- 
gory, 775 to 2900 miles, Table VI of Ref. 12) 
DL, = TUR + RLR + 7.7 
16.1 dB. 


Noise (NV): Assume no noise from the loops: 


Nr = Total noise from the Class 5 office-to-Class 5 
office connection (33.8 dBrnC for the connec- 
tion length category 775 to 2900 miles, Table 
III of Ref. 12 referred to the telephone set) 

= 28.5 dBrnC 
N = 32.5 dBrnC. 


I 


TELEPHONE CONNECTION QUALITY 1369 


Talker echo (assumed negligible) : 
Using Table II with L, = 16.1 dB and N = 32.5 dBrnC, 
Rin = 71. 
Using Table IT for the MH data base, 
GoB = 65.2%, PoW = 13.9%. 


A.5 Example 5—toll connection 


Same as Example 4, Section A.5, except that it takes into account 
talker echo: 


Echo-path delay (D) 
D = Far-end echo-path loss 
= 37.3 ms (mean for connection length 1450 to 2900 miles, 
Table III of Ref. 11). 


Echo-path loudness loss (E£) 
EE’ = Loss of echo path from near-end Class 5 office to distant 
end and return, 
= 23.38 dB (mean for connection length 1450 to 2900 miles, 
Table II of Ref. 11). 
& = TLR + RLR + EL’ 
= 31.7 dB. 


Using Table II with DL, = 16.1 dB, N = 32.5 dBrnC, D = 37.3 ms, 
and # = 31.7 dB. 


Rin = 71 
Rez = 82.6 
Ring = 65.2. 


Using Table II for the MH data base, 
GoB = 52.6%, PoW = 22.4%. 
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LPC Parameters 


By M. R. SAMBUR and N. S. JAYANT 
(Manuscript received May 14, 1976) 


This paper discusses several manipulations of LPc (linear predictive 
coding) parameters for providing speech encryption. Spectfically, the 
paper considers temporal rearrangement or scrambling of the LPC code 
sequence, as well as the alternative of perturbing individual samples in 
the sequence by means of pseudo-random additive or multiplicative norse. 
The latter approach is believed to have greater encryption potential than 
the temporal scrambling technique, in terms of the time needed to “break 
the secrecy code.”’ The encryption techniques are assessed on the basis of 
perceptual experiments, as well as by means of a quantitative assessment 
of speech-spectrum distortion, as given by an appropriate ‘‘distance”’ 
measure. 


l. INTRODUCTION 


Encryption can be an important requirement in speech communica- 
tion systems. Conventionally, encryption has largely been accomplished 
by signal manipulations in the frequency domain; for example, by 
means of spectrum inversion techniques.! With the increased popularity 
of digital codes for speech transmission, time-domain encryption tech- 
niques have received increased attention. Typically the time-domain 


encryption technique consists of temporal rearrangement of samples 
within a time block. For the scrambling of pcm bits in speech wave- 


form coding, a block-length that is at least a pitch period long is 
usually adequate to provide a nonspeech-like output waveform. 
Similarly, the scrambling of differential pcm and delta-modulation bits 
can also produce a nonspeech-like output waveform provided that the 
time-block is sufficiently long. For example, in a 24-kb/s speech code, 
this constraint implies approximately a block length of 64 samples for 
an adequate scrambling of the coded bits.” 

The temporal scrambling of speech samples within millisecond- 
length blocks generally provides what may be referred to as casual 
encryption. This means that a noncasual ‘eavesdropper’ can break the 
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speech secrecy code by the simple expedient of running through a finite 
number of possible rearrangements of the disarranged speech samples 
that are received. Greater degrees of encryption or secrecy can be 
achieved by employing much longer speech blocks for scrambling or, 
alternatively, by subjecting individual speech samples to pseudo- 
random additive or multiplicative perturbations whose undoing is 
typically more time-consuming than a simple temporal rearrangement 
of clean digits or bits. 

The purpose of this paper is to point out that casual encryption as 
well as more formal secrecy can be achieved by appropriate manipula- 
tions of the linear predictive coding (LPc) parameters.*4 The use of 
an LPC code is by no means a necessary requirement for encryption ; it 
can be achieved in conjunction with any kind of speech digitizers, such 
as the waveform codes’ discussed above. However, when the channel 
capacity of communication systems dictates a low-bit-rate vocoder 
instead of a generally higher-bit-rate waveform code, the Lpc parameter 
manipulations discussed in this paper may provide a naturally ap- 
propriate basis for speech encryption and/or secrecy. It shall also be 
seen that an efficient encryption of the Lpc parameters can be achieved 
more readily than similar techniques used to encrypt waveform codes. 
For example, an adequate block length for scrambling the Lec parame- 
ters can be as short as 6 to 8 samples, while the block length for wave- 
form scrambling is typically 16 to 64 samples. 

In this paper, Section II provides a brief description of Lpc encoding 
of speech, while Section III considers the use of temporal scrambling 
and pseudo-random sample perturbations for casual and formal 
encryption in the Lpc domain. Section IV describes attempts to measure 
the efficacy of the encryption techniques. These measurements in- 
volved informal perceptual experiments (the results are usually un- 
ambiguous and one-dimensional enough not to require formal sub- 
jective testing), as well as a comparison of alternative techniques in 
terms of speech-spectrum distortions that they provide. The spectrum 
distortion was assessed by an appropriate distance measurement. This 
distance approach has the advantage of being quantitative; however, 
as discussed in Section IV, the distance criterion has to be invoked 
with caution because spectral distortion, as such, is not a definitive 
measure of speech encryption. 


Il. LINEAR PREDICTION SPEECH MODEL 


The method of linear prediction has proved quite popular and suc- 
cessful for use in speech-compression systems.*®%7 In this method, 
speech is modeled as the output of an all-pole filter H(z) that is excited 
by a sequence of pulses separated by the pitch period for voiced sounds 
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Fig. 1—Discrete model of speech production as employed in linear prediction. 
(a) Frequency-domain model. (b) Time-domain model. 





(b) 


or pseudo-random noise for unvoiced sounds. These assumptions imply 
that within a frame of speech, the output speech sequence is given by 


p 
Sn = 2 AkSn—k <r Gun, 
=1 


where p is the number of modeled poles, u, is the appropriate input 
excitation, G is the gain of the filter, and the a,’s are the coefficients 
characterizing the filter (linear prediction coefficients). Figure 1 il- 
lustrates the frequency-domain as well as the equivalent time-domain 
model of linear prediction speech production. To account for the non- 
stationary character of the speech waveform, the parameters a; of the 
modeled filter are periodically updated during successive speech 
frames.* Generation of speech in this method requires a knowledge of 
the pitch, the filter parameters, and the gain of the filter (amplitude of 
excitation) in each speech frame. 

The tec coefficients model the combined effects of the vocal tract, 
glottal source, and radiation load in each frame of speech. Manipula- 
tions of the LPc coefficients can seriously perturb the frequency charac- 
ter of the speech signal and, hence, destroy the linguistic information 
present in the signal. In contrast, the measurements of pitch and gain 
represent the prosodic aspects of the speech and some characteristics 


* A frame is a segment of speech thought adequate to assume stationarity of the 
speech process. Typical frame lengths employed range from 10 to 30 ms. 
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of the speaker. Manipulations of pitch and gain parameters will affect 
the prosody of the speech, but not seriously diminish the linguistic 
aspects of the waveform. In Section III, we consider several methods for 
efficiently manipulating the Lrc coefficients so as to encrypt the speech 
signal. 

Since the purpose of this paper is the consideration of encryption 
techniques for low-bit-rate vocoders (2.4 kb/s or less), the manipula- 
tion schemes discussed in Section III were not performed directly on 
the Lrc coefficients, but rather on more desirable alternate representa- 
tions of these coefficients. The stability of the linear-prediction filter, 
H(z), is extremely sensitive to small perturbations in the rc coefficients 
and, thus, it is not possible to achieve low-bit-rate coding by trans- 
mitting the Lpc coefficients.° However, by transmitting either the log 
area coefficients or the parcor coefficients, a 2.4-kb/s vocoder is readily 
achieved.® The log area coefficients are nonlinearly related to the Lec 
coefficients by 

ny = log Lt 
: 1 — k;’ 
where the k,’s are termed the parcor coefficients.’ If we denote aj? as 
the zth linear prediction coefficient for a jth-pole linear-prediction 
model, then 


k; = af’. 
The parcor coefficients have the very important property that if 
[ki] <1, a= 1,--:,D, 


then it is guaranteed that the linear prediction filter is stable.* Thus, 
small perturbations in the parcor coefficients or log-area coefficients 
will not affect the stability of the modeled filter. 


Ill. ENCRYPTION TECHNIQUES 
3.1 Temporal scrambling 


The rearrangement of samples within a block of length L is achieved 
by assigning to each sample a new address A (A = 1, or 2, or 3, -°-, 
or LZ) as determined by the state of a maximal-length shift-register 
arrangement. The theory and design of maximal length sequences is 
well documented.®* Here, we simply provide a constructive recapitula- 
tion for the purpose of this paper. The idea is to start with a shift 
register whose length is D = loge L (assume that the block length is 
a power of 2, and that elements in the register are either 1 or 0). The 
next step is to select a so-called primitive polynomial Pp(x) of degree 
D, and to include stage (D — 8S) in the register (S = 0 to D — 1) in 
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an exclusive or (modulo 2 add) feedback arrangement, if the coefficient 
of x5 in P(x) is nonzero. The resulting network now generates a suc- 
cession of 2? — 1 = LZ — 1 nonzero states in the shift register at suc- 
cessive ‘clock’ times, after which the cycle repeats, starting once again 
with the original initial state of the shift register. The number of 
nonzero states in the cycle is identically equal to the repetition period 
L—1 of the cycle. Consequently, the L — 1 states of the shift- 
register (specifically, their decimal equivalents) can be utilized as 
“pseudo-random”’ addresses for a block of LZ — 1 input samples in a 
one-to-one mapping of addresses. If the input block has Z rather than 
L — 1 samples (because of the frequent requirement that L be a power 
of 2), the address of the Zth sample is usually left unaltered by the 
scrambler. Such simplicity is not, however, inevitable, and appropriate 
manipulations that scramble all Z samples are quite conceivable. 

Figure 2 illustrates the scrambler design for the example of D = 3 
and L =7, as defined by a primitive polynomial P3(X) = X° 
+ X?-+ 1. It is seen how input samples (1, 2, 3, 4, 5, 6, and 7) get 
scrambled into the pseudo-random positions (1, 4, 6, 7, 3, 5, and 2) in 
a reversible one-to-one mapping. 

Figure 3 illustrates an alternative design, as defined by a second 
primitive polynomial of degree 3, P3(X) = X3+ X + 1. In this case, 
the output addresses of the input samples are the positions (1, 4, 2, 5, 
6, 7, and 3). 

It is clear that in each of the arrangements in Figs. 2 and 38, the use 
of a different initializing sequence (other than 001) can lead to a 
totally different mapping of sample addresses. There would be L — 1 
nonzero initializations, corresponding to every given P3(X). Inciden- 
tally, the number of primitive polynomials of degree 3 is 2. 


STATE OF SHIFT DECIMAL 
: NUMBER 
PRIMITIVE POLYNOMIAL: REGISTER EQUIVALENT 


3 2 
Xv + Xo +1 001 


1 
100 4 


CORRESPONDING NETWORK 





INPUT REPETITION OUTPUT 
ADDRESSES PERIOD =7 ADDRESSES 





Fig. 2—Scrambler design with a three-stage shift register. 
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STATE OF SHIFT DECIMAL 


PRIMITIVE POLYNOMIAL: ee REGISTER EQUIVALENT 
eee 1 001 1 
100 4 
010 2 
CORRESPONDING NETWORK 101 5 


110 


6 
111 7 
3 


011 
f 001 f 


INPUT REPETITION OUTPUT 
ADDRESSES PERIOD = 7 ADDRESSES 





Fig. 3—Alternate scrambler design with a three-stage shift register. 


Table I lists for D = 1 to 12 a typical set of primitive polynomials 
and also the number of primitive polynomials for each D. Note, for 
example, that a 12-stage shift register with an exclusive or feedback 
network involving stages 12, 11, 8, and 6 provides one of 144 possible 
bases for ascrambler that would operate on an input block of 2” = 4096 
samples. 

The possibility of alternate scrambler designs (as defined by differ- 
ent initializations and/or different primitive polynomials) is an im- 
portant consideration from the point of view of the average descram- 
bling time needed for an eavesdropping code-breaker. 


3.2 LPC parameter scrambling 


The effectiveness of any scrambling scheme in perturbing the se- 
quence of samples is directly proportional to the lack of similarity or 


Table |—List of primitive polynomials 
Degree D Typical Primitive Polynomial Palas Spee D 
1 X +1 1 
2 AX?74+X+4+1 1 
3 X'+X+1 2 
4 X'+X-+1 2 
5 X§'+ X24 1 6 
6 X'+xX-+1 6 
7 XTH+X +1 18 
8 X§+ X4+ X84 X?4 1] 16 
9 xX®°+ X?+ 4 48 
10 A + X3 + I 60 
11 AU + X24 1 176 
12 AV+XF4+X44+ X41 144 
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dynamic ranges of the samples to be scrambled. The greater the range 
of values assumed by the samples, the more effective the scrambling 
scheme.! For an efficient scrambling of the Lec parameters, let us begin 
by ordering the parameters in the following manner: the first sample 
‘in the first block is 211, where z;, denotes the ith LPc parameter* in 
the nth analysis frame. The second sample is x2; and the third sample 
is 731. The arrangement proceeds in this fashion until the (p + 1)tht 
sample, which is defined as x12. Thus, the ordering of Lpc parameters 
for purposes of scrambling is simply a concatenation of the p Lrc 
parameters in each sequential analysis frame. 

Using this particular arrangement, it can be seen that within a block 
of data there is a wide distribution of values assumed by the various 
samples. This observation follows from the fact that the measured 
Lec parameters for any given analysis frame will usually vary across 
the entire permissible range of values. For example, the p measured 
values of the parcor coefficients in any given frame will typically be 
somewhat uniformly spread across the permissible range of —1 to 1.4 
The particular arrangement of LPc parameters given above will thus 
be effective for scrambling purposes due to the large resulting dynamic 
range. In Section IV, we show that a block length as small as eight 
samples (Z = 8) is sufficient to destroy the linguistic information in the 
_ synthetic signal produced by a 12th order analysis (p = 12). 


3.3 Pseudo-random perturbations 


For a more secure secrecy coding of the speech signal, the LPc pa- 
rameters can be modified by a pseudo-random additive or multiplica- 
tive perturbation. Since the repetitive period of any typical pseudo- 
random number generator is extremely large, the process of undoing 
or breaking the encryption is quite difficult and time-consuming. 

Since one of the goals of the present study was to perceptually assess 
the linguistic information in the synthesized speech generated by the 
encrypted Lec parameters, the pseudo-random number perturbation 
scheme was designed to retain the stability of the modified rc filter. 
Thus, for the manipulation of the parecor coefficients, the pseudo- 
random number technique involved the transmission of the sequence of 
parameters 

Yin = kin X Tin, 
where 


kin = tth parcor coefficient in nth frame 
ith pseudo-random number in nth frame; |r:,| S 1. 


Tin 


“The Lec parameters considered in this paper are either the log-area coefficients 
or parcor coefficients. 
= order of Lrc analysis. 
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Since [rin] S 1, |yin| < 1 and the stability of the irc filter is guaran- 


teed. For the modification of the log-area coefficients, the technique 
is simply to transmit 


Yin = Qin + Tin- 


The stability of the resulting rc filter is guaranteed regardless of the 
range of rin. This result follows from the fact that any real value of 
Yin Will lead to parcor parameters that are less than 1. 

In viewing the pseudo-random number manipulation of the rc 
parameters, it should be noted that the spectral characteristics of the 
uPc filter are more sensitive to changes in the parcor coefficients than 
to changes in the log-area coefficients.!° Thus, manipulation of the 
parcor coefficients is a more direct and efficient technique for perturbing 
the spectral properties of the upc filter. For this reason the pscudo- 
random techniques discussed in this paper were applied only to the 
parcor coefficients. If pseudo-random number manipulation is to be 
applied to the log-area coefficients, the manipulation can be made most 
effective if the probability distribution of the random number generator 
is nonuniform, in order to mimic that of the log-area coefficient.” 

For the experimental examination of the pseudo-random number 
perturbation of the parcor coefficients, the following two probability 
distributions were used for generating r;,: 


(t) rin Was uniformly distributed between —1 and 1, or 
(12) Tin Was, with equal probability, set to —1 or 1. 


The second distribution was studied because the resulting manipulation 
of the parcor coefficients is particularly easy to perform and, as we 
shall soon discuss, is effective in destroying the intelligibility of the 
encrypted speech. However, the ‘“‘breaking”’ of the encryption coding 
using the second distribution is not difficult to achieve by using the 
available knowledge of the statistical range of the parameters. For 
example, it 1s well known that the first parcor coefficient is almost 
always positive.* Thus, a negative value of the first parcor coefficient 
indicates a manipulation of this parameter. If the listener knows that 
a +1 or —1 manipulation of the parameters is being employed, then a 
simple reversal of sign breaks the encryption. 


IV. EXPERIMENTAL STUDY 


In this section, we examine the effectiveness of the various encryption 
techniques in destroying the intelligibility of the output speech signal. 
For this purpose, an informal perceptual evaluation was conducted. 
To evaluate objectively the efficacy of the techniques, an LPc distance 
measure proposed by Itakura" was used to reinforce and supplement 
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the perceptual examination. Before discussing the Lec distance mea- 
sure, we emphasize that this measure may not be a definitive or com- 
plete description of encryption efficiency; but it is a good measure of 
spectral distortion, which in turn turns out to be a useful (Gif not ideal) 
indicator of intelligibility loss. 


4.1 Distance measure 


The Lec distance measure is defined as 


dy, = In (a,Va7/bnVb3), 
where 


a, = Original tre coefficient vector (1, a1, ---, @)) measured in 
the nth frame of the speech signal. 
b, = Pc coefficient vector determined after manipulation of the 
original parameters in the nth frame 
and 
V=([o(l¢-3))] @&j=0,1,---,p), 


where v(z) are the normalized correlation coefficients that are computed 
directly from b,.* 

The measure d, has been very effectively applied in problems of 
speech recognition,!! speaker recognition,” and variable frame-rate 
synthesis.*4 Gray and Markel! have recently demonstrated that the 
measure d, 1s very closely related to the rms spectral distance measure. 
Sambur and Jayant!® have also studied the significance of the measure, 
and a complete discussion of the utility of the measure for assessing 
spectral distortions can be found in their paper. For purposes of this 
paper, the important facts to appreciate about the measure d, are 


(2) The greater the value of d,, the more pronounced the spectral 
distortions of the original sound. 


(iz) A value of d, = 0.9 1s a “perceptually” significant boundary for 
evaluating spectral distortion.” 


4.2 Experiment 


For the experimental study, four sentences spoken by four different 
speakers were analyzed using a 12th order (p = 12) upc autocorrelation 
analysis for each contiguous 20-ms frame. The sentences analyzed were: 


(2) A lathe is a big tool. 
(22) May we all learn a yellow lion roar. 
(222) Few thieves are never sent to the jug. 
(wv) It’s time we rounded up that herd of Asian cattle. 
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The encryption schemes that were formally evaluated both per- 
ceptually and with the distance measure of Section 4.1 were: 


a. Serambling 
(1) Block length = 16 


(2) Block length = 8 
b. Pseudo-random manipulation* 
(1) Uniform distribution of ri, for 7 = 1 and rin = 1 for 7 > 1. 


(2) Uniform distribution of r;, for 7 S 6 and r;, = 1 for z > 6. 


(3) Uniform distribution of 7;, for allz (1 $7 S 12). 

(4) +1 distribution of 7;, for? = 1 and r;, = 1 for? > 1. 
(5) +1 distribution of r;, for 7 S$ 6 and r;, = 1 fori > 6. 
(6) +1 distribution of r;, for all (1 S$ 7 S 12). 


Experiment b was performed to determine the number of parcor co- 
efficients that must be altered to effectively encrypt the signal. Since 
the parcor coefficients are approximately ordered in terms of their 
spectral sensitivity, these experiments were performed by sequentially 
removing from manipulation the less sensitive parameters. 


4.3 Results 


4.3.1 Distance evaluation 


4.3.1.1 Uniform pseudo-random manipulation. Figure 4 illustrates the dis- - 
tance-evaluation of the sentence ‘‘May we all learn a yellow lion roar’ 
for the uniform pseudo-random number manipulation of the parcor 
coefficients. Parts (a), (b), and (c) of Fig. 4 indicate, respectively, the 
results of experiments b(1), b(2), and b(8) of Section 4.2. The straight 
solid line in each part of the figure depicts the perceptually significant 
threshold for assessing spectral distortions (d = 0.9). Any frame with 
a distance larger than the threshold is perceptually different from the 
nonencrypted speech. To show just how dramatically the perturbation 
in the spectral character of the speech can be, Fig. 5 illustrates the 
calculated linear prediction spectrum (dotted line) for the nonen- 
crypted speech frame and the corresponding linear prediction spectrum 
(solid line) for the same frame of encrypted speech. The measured 
LPc distance between the illustrated spectra is d, = 3.0, or approxi- 
mately the average value of distance for uniform pseudo-random 
manipulation of the first coefficient. From this figure, it can be ex- 
pected that the character of the encrypted speech is completely differ- 
ent from that of the original speech. 


*Remember 7;, denotes the pseudo-random number multiplicative factor for the 
7th parcor coefficient in the nth frame. 
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Fig. 4—.pc distance as a function of time across the utterance, “May we all 
learn a yellow lion roar,’’ for uniform pseudo-random perturbation of the parcor 
parameters. (a) Manipulation of k1; average distance = 3.4. (b) Manipulation of 
ky to kg; average distance = 4.4. (c) Manipulation of all k;; average distance = 4.4. 


The results depicted in Fig. 4 are typical of the distance evaluation 
results for the uniform pseudo-random manipulation of the parcor 
coefficients determined for the other sentences examined. It is interest- 
ing to note that the average distance for an encryption scheme that 
manipulates the first six parameters is not significantly lower than the 
average distance obtained for the manipulation of all 12 parameters. 
This result can be anticipated from the fact that the higher-ordered 
parcor coefficients are much less sensitive than the lower-ordered pa- 
rameters, and changes in these higher-ordered parameters do not 
significantly change the spectral character of the sound.’ Thus, a less- 
expensive and equally effective encryption scheme can be obtained 
by manipulating only a few lower-ordered parameters. To determine 
the optimum number of parameters necessary for an efficient, uniform, 
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Fig. 5—Comparison of the distorted Lec spectra and the original Lec spectrum. 
Distance between the spectrum equals 3.0. 
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Fig. 6—Average Lrc distance as a function of number of parcor coefficients manipu- 
lated by pseudo-random number techniques. 


pseudo-random encryption, we sequentially increased the number of 
parcor parameters perturbed by uniform pseudo-random manipulation 
and measured the average Lrc distance. Figure 6 illustrates the average 
LPC distance as a function of the number of parameters manipulated. 
From this figure, 1t can be seen that a scheme that perturbs only the 
first four parcor coefficients is quite efficient. 


4.3.1.2 Pseudo-random manipulation of +1 or —1. Figure 7 shows the de- 
tailed distance-evaluation scores for the +1 or —1 pseudo-random 
perturbation of the sentence ‘‘May we all learn a yellow lion roar.”’ 
Parts (a), (b), and (c) of the figure correspond to experiments b(4), 
b(5), and b(6), respectively. Figure 6 illustrates the average LPc dis- 
tances obtained for encryption schemes that sequentially increase the 
number of parameters subjected to +1 or —1 pseudo-random manipu- 
lations. We note from Figs. 6 and 7 that again the perturbation of 
the higher-ordered parcor coefficients does not significantly add to the 
effectiveness of the encryption scheme. It can also be seen from these 
figures that +1 or —1 pseudo-random manipulation is generally 
superior (except for the manipulation of only kh) to the uniform 
pseudo-random number scheme in distorting the speech signal. How- 
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ever, as noted previously, this form of encryption is easier to break 
than uniform pseudo-random number coding. 


4.3.1.3 Scrambling. Figure 8 shows the frame-by-frame distance scores 
for the scrambling of the parcor coefficients for the sentence ‘‘May we 
all learn a yellow lion roar.” The illustrated results are typical of the 
results obtained for the other analyzed sentences. A comparison of the 
distances results of the pseudo-random schemes (Fig. 6) shows that a 
scrambling encryption with a block length of only eight samples (ZL = 8) 
is at least as effective in distorting the spectral properties of the original 
signal as a pseudo-random manipulation of the first parcor coefficient. 
A scrambling scheme with a block length of 16 (Z = 16) or more 
samples is superior to any of the pseudo-random schemes studied. It is 
interesting to note that the scrambling manipulation saturates in 
effectiveness for block length greater than 16. Since the range of the 
parcor coefficients is confined to —1 S k; < 1, increasing the block 
length beyond 16 does not increase the dynamic range of the sample 
within the block and, thus, the effectiveness of the scrambling is not 
enhanced for L > p (see Section 3.2). 


4.3.2 Perceptual evaluation 
To support the results of the distance study, the various encrypted 


utterances were presented to a group of listeners for an informal 
perceptual evaluation of the manipulation schemes. To avoid any 
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Fig. 7—.rc distance as a function of time across the utterance ‘‘May we all learn 
a yellow lion roar” for the +1 or —1 pseudo-random manipulation of the parcor 
coefficient. (a) Manipulation of k1; average distance = 2.8. (b) Manipulation of ky 
to ks; average distance = 6.2. (c) Manipulation of all k;; average distance = 6.3. 
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Fig. 8—uprc distance as a function of time across the utterance ‘‘May we all learn 
a yellow lion roar’’ for the scrambling of the parcor coefficient. (a) Block length = 8; 
average distance = 3.8. (b) Block length = 16; average distance’= 7.7; (c) Block 
length = 64; average distance = 7.6. 


problems posed by the awkward linguistic content of the analyzed 
sentences, the listeners in this study were all familiar with the sen- 
tences, and were also informed that the utterances to be heard were 
typical sentences used to evaluate vocoder systems. 

The listeners in the experiment were asked to determine the intel- 
ligibility of the utterance and to rank-order the effectiveness of the 
encryption schemes. For all the techniques studied, except for the +1 
or —1 manipulation of only ki, the listeners unanimously agreed that 
the encrypted utterances were clearly nonspeech-like. However, for 
the uniform pseudo-random techniques manipulating only the first 
parcor coefficients, the listeners noted that, even though the complete 
utterances could not be understood, there were certain instances in the 
encrypted utterances that were somewhat speech-like and under- 
standable. These instances probably correspond to points in the 
encrypted speech for which the urc distances fall below the perceptual 
threshold. In characterizing the nonspeech-like quality of the en- 
crypted utterances, the listeners termed the pseudo-random perturbed 
utterances as sounding like ‘‘one continuous buzz;”’ the scrambled 
utterances sounded like ‘‘water running through a pipe.” 

In rank-ordering the encryption schemes, the listeners were quite 
definite in characterizing the +1 or —1 psuedo-random manipulation 
of only the first parcor coefficient as least effective. The scrambling 
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with block length of 16 (d = 7.7) was ranked about equal to the +1 
or —1 pseudo-random manipulation of all 12 parcor coefficients 
(d = 6.3), and also to the same manipulation of only the first six 
coefficients (d = 6.2). The uniform pseudo-random scheme that 
altered all 12 coefficients (d = 4.4) was ranked equal to the scheme that 
perturbed only the first six coefficients (d = 4.4), and both techniques 
were ranked slightly less effective than the scrambling with block 
length of 16 (d = 7.7) and the equivalent +1 or —1 pseudo-random 
schemes. The other techniques were ranked somewhere in the middle. 
The perceptual rank-ordering of. the various manipulation schemes 
corresponded almost exactly to the distance evaluation and, thus, 
reinforced the conclusions in that evaluation. 


V. CONCLUSIONS 


There is great interest in low-bit-rate speech-transmission systems 
and in the “securing’’ of these transmission systems. The purpose of 
this paper is to investigate various methods for encrypting a low-bit- 
rate Lec transmission system. The methods chosen for investigation 
were schemes that either scrambled the string of input parcor coeffi- 
clents or multiplied the coefficients by a pseudo-random number. The 
schemes were evaluated by an informal perceptual experiment and by 
the use of an Lec distance measure. The results of the evaluations 
suggest that all the schemes are somewhat successful in distorting the 
original signal. The most successful scheme was the scrambling tech- 
nique with a block length of 16 samples. The pseudo-random manipula- 
tions were almost as effective. 

In viewing the results of the evaluations, it is important to note that 
the distortion of the speech signal is only one consideration in designing 
an encryption system. Another consideration is the difficulty of 
“breaking” the security code. Of the codes examined, the uniform 
pseudo-random number manipulation is the most difficult to break. 
The scrambling scheme is the next most difficult and the +1 or —1 
pseudo-random scheme is the easiest. Still another consideration is the 
transmitter-end complexity of the encryption scheme. Although this 
complexity is somewhat difficult to assess, it appears that the scram- 
bling scheme is the least complex and the uniform pseudo-random ma- 
nipulation is the most complex. In choosing any of these encryp- 
tion schemes, a user would balance the various merits and liabilities of 
the techniques. 


VI. ACKNOWLEDGMENT 


The manipulation of Lec parameters was suggested by Professor 
B. S. Ramakrishna of the Electrical Communication Engineering 


SPEECH ENCRYPTION 1387 


Department, Indian Institute of Science, Bangalore, during a visit 
by one of the authors (N.S. Jayant). 


REFERENCES 


. D. Kahn, The Code Breakers, New York: Macmillan, 1967, pp. 551-554. 

. N.S. Jayant, “Step-size Transmitting Differential Coders for Mobile Telephony,”’ 
BS8.T.J., 54, No. 9 (November 1975), pp. 1557-1581. 

. B.S. Atal and 8. L. Hanauer, “Speech Analysis and Synthesis by Linear Pre- 
diction of the Speech Wave,” J. Acoust. Soc. Amer., 50 (1971), pp. 637-655. 

. J. D. Markel, A. H. Gray, Jr., and H. Wakita, ‘Linear Prediction of Speech- 
Theory and Practice,’”’? Speech Communication Research Laboratory, Inc., 
Santa Barbara, Calif., Monograph 10, 1978. 

. N.S. Jayant, “Digital Coding of Speech Waveforms—PCM, DPCM, and DM 
Quantizers,’’ Proc. IEEE (May 1974), pp. 611-632. 

. M. R. Sambur, ‘‘An Efficient Linear Prediction Vocoder,”’ B.S.T.J., 54, No. 10 
(December 1975), pp. 1693-1723. 

. F. Itakura et al., “An Audio Response Unit Based on Partial Autocorrelation,’’ 
IEEE Trans. Commun., COM-20, No. 4 (August 1972), pp. 792-797. 

. S. Golomb, Shift Register Sequences, San Francisco: Holden Day, 1967. 

. R. G. Gallager, Information Theory and Reliable Communications, New York: 
John Wiley, 1968. 

10. J. Makhoul and R. Viswanathan, ‘Quantization Properties of Transmission 
Parameters in Linear Predictive Systems,’’ Bolt Beranek and Newman, Inc., 
Report No. 2800, April 1974. 

11. F. Itakura, “Minimum Prediction Residual Principle Applied to Speech Rec- 
ognition,’? IEEE Trans. Acoust., Speech and Signal Proc., ASSP-23, No. 1 
(February 1975), pp. 67-72. 

12 H. Wakita, “On the Use of Linear Prediction Error Energy for Speech and Speaker 
Recognition,’ J.A.S.A., 67, Supplement No. 1 (Spring 1975). (A) 

13. D. T. Magill, ‘Adaptive Speech Compression for Packet Communication Sys- 
pe Telecommunications Conference Record, IEEE Publ. 73, CH0805-2, 
29D 1-5. 

14. J. R. Makhoul, L. Viswanathan, L. Cosel, and W. Russel, ‘“Natural Communi- 
cation with Computers: Speech Compression Research at BBN,’’ BBN Report 
No. 2976, Vol. II, Bolt Beranek and Newman, Inc., Cambridge, Massachu- 

.setts, December 1974. 

15. A. H. Gray and J. D. Markel, ‘“COSH Measure for Speech Processing,’’ J.A.S.A., 
§8, Supplement No. 1 (Fall 1975). (A) 

16. M. R. Sambur and N.S. Jayant, ‘“LPC Analysis/Synthesis From Speech Inputs 

Containing Noise or Additive White Noise,’? IEEE Trans. Acoust., Speech, 

and Signal Proc., ASSP-24, No. 6 (December 1976). 


Pe WO Ne 


on NN BD Co 


1388 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1976 


Copyright © 1976 American Telephone and Telegraph Company 
THE BELL System TECHNICAL JOURNAL 
Vol. 55, No. 9, November 1976 
Printed in U.S.A. 


Multiple Tone Parameter Estimation From 
Discrete-Time Observations 


By D. C. RIFE and R. R. BOORSTYN * 
(Manuscript received May 11, 1976) 


In a previous paper, we discussed estimation of the parameters of a 
single tone from a finite number of noisy discrete-time observations. In 
this paper, we extend the discussion to include several tones. The Cramér- 
Rao bounds are derived and their properties examined. Estimation 
algorithis are discussed and characterized. 


l. INTRODUCTION 


In a previous paper,! we reported on the estimation of the parameters 
of tones from a finite number of noisy, discrete-time observations and 
described the case of a single complex tone. In this report, we discuss 
the situation when the signal consists of several, say k, tones, either 
real or complex. By real stgnal we mean 


k 
s(t) = os b; COS (wt + 6;). 
i=l 
The corresponding complex signal is of the form 
k 
s(t) + j§() = » b; expLj (wit + 43) J, 


where s(t) is the Hilbert transform of s(t). 

A computer observes, through the A—D converters, noisy versions 
of the signal, X (¢), and possibly its Hilbert transform Y(t). That is, 
samples are taken of 


X(t) = sé) + Wi), (1) 
and 


Y(t) = st) + WO, (2) 


where W(t) and W(t) are the noise and its Hilbert transform, 
respectively. 


* Polytechnic Institute of New York. 
1389 


The observations are made at times denoted t,. The computer will 
process one or both sample vectors: 


K = [Xo Xy--+)Xw? and Y= [Yo Ya -++, Yeal’, 
where 7’ denotes matrix transpose, 


Xn = X (tn), (3) 
and 


We assume the noise samples, W, and Wn, are independent, zero- 
mean, gaussian random variables with variance o’. 

Let a be the p-element vector of unknown signal parameters. We 
assume all signal parameters are unknown, so that p = 3k, and use the 
convention: 


AZi-2 = Wy (5) 
Q3i-1 >= b;, 
and 
a3; = 0: ~=1tok. 


This model describes several situations. The real signal may be 
received from a data set during a test or it could be a probe signal used 
to characterize a data-transmission channel. The real and imaginary 
parts of the complex signal could occur:as the result of in-phase and 
quadrature modulation processes, as described by Palmer.? The imagi- 
nary part of the complex signal could be the output of a 90-degree 
phase-shift network (Hilbert transformer) through which the real 
signal is passed before the sampling is done. This is done in certain 
types of data sets that use all-digital means to demodulate received 
signals. Samples of the complex signal are easier to process because of 
the absence of negative frequency components, as we show below. The 
model also applies to certain mathematically equivalent, phased-array 
radar problems, such as the one described in Refs. 3, 4, 5, and 6. 

There are two main aspects to the problem of estimating the pa- 
rameters of the signal: lower bounds to estimation accuracy and 
algorithms for doing the estimation. In the next section, the properties 
of the Cramér-Rao (c-R) bounds are explored. There are many other 
bounds that could be applied but we have only examined the c-r 
bounds. Section III describes and evaluates some approximations to 
maximum-likelihood (ML) estimation. In Ref. 1, we found that when 
the signal consists of a single complex tone (&k = 1), then Mt estimates 
can be obtained with any desired accuracy. When several tones are 
present, ML estimation is sufficiently complicated that suboptimum 
alternatives are attractive. 
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Il. CRAMER-RAO BOUNDS 
2.1 General theory 


Maximum-likelihood estimates of signal parameters are unbiased at 
high signal-to-noise ratio (s/n).4*7 We will develop estimation algo- 
rithms that have very little bias, so we have only studied the c-r 
bounds to unbiased estimation accuracy. Even when an estimator has 
some bias, the unbiased bounds serve as useful goals for estimation 
accuracy. Since the accuracy of ML estimates approaches the unbiased 
c-R bounds at high s/n, the unbiased bounds also show what could be 
done if exact ML estimation algorithms were used. 

We found in Ref. 1 that low s/n is that range of s/n where estima- 
tion anomalies occur. None of the known bounds seem to be very tight 
under these conditions. 

The first property of the c-R bounds that we consider is one for which 
we need the following general notation. 

Let V be a “signal” vector whose typical component is of the form 


K 
ier = >. bigs (ws, 0;, n). (6) 
i=1 


Notice that each g;(-) has an associated level, b;, and is a function only 
of n and the zth set of unknown parameters. Time does not necessarily 
enter into the g:(-) functions. Let X be a noisy observation of V. 
Assume the noise is additive, multivariate normal with zero mean and 
correlation matrix R—. If the noise vector is W, then 


K=V+W, (7) 
and the probability density function of X given V is 


Riz 

fK/V) = Gemexl-HK-WRE-VYL 
where N 1s the dimension of V and the 7’ denotes transpose (see Ref. 8, 
page 207). 

The c-R bounds require certain regularity conditions on V, which 
are satisfied by our model.? The bounds are the diagonal elements of 
the inverse of the Fisher information matrix, J, whose typical element” 
Is: 

2 


0 
Jab ~~ lt | Oa,0Q, log s , (9) 





where H/{-} denotes expected value of {-}. The bounds are: 


Var {& — a} 2 J, (10) 
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where J* is the ath diagonal element of J- and @, is an unbiased 
estimate of aa. 
It is easy to show" that 


T 
ov Rov 


dab a dan 00a (11) 





We now present a few theorems that characterize the c-R bounds. 
We assume J is not singular, for reasons discussed below. 


Theorem 1: The C-R bounds to unbiased estimation of the parameters w; 
and 6; of V are functions of b; but are independent of the other levels, 
bj: 7 #2. The bound to unbiased estimation of a level, b;, 1s independent 
of all the levels. 


Proof: Equation (11) is equivalent to 
OV n OV mn 


Oa, Oa,’ 





5 = »; >, im (12) 
nm 
where Ram 1s an element of R. 
The elements of J that are functions of the parameters of g; and q;, 
using the convention given by (5) and the notation g;(n) = gi (wi, 0:, 7), 
are: | 











00; 
J eseech bs; oe Tees, Sait) agit) (13a) 
n m Wy Wj 
Og: 
Fevaaina = 1: T Ram HE gsm), (18b) 
sone Ba dgi(n) dg; (m) 
J 3i—2,3; = b:b; 2 oe Rnm dw: a0; (13¢) 
0qg;(m 
J 3i-1,33-2 = 0; 2) DE Rumgi(n) Sait) (13d) 
nr m J 
J 3i—1,33-1 = DD Ramgi(n)g;(m). — (18e) 
0q;(n 
J 3i-1,33 = 6; DL Ramgi(n) Sas), (13f) 
nm j 
0g; 0g; 
J 3i,aj-2 = bib; 2) DL Ram Sask) Sait). (13g) 
nr m a j 
Og: 
J3i3ji—1 = 0; 0) dD Ram ast) gi(m). (13h) 
dgi(n) Ag; (m) ; 
a iQ dh 0; 06; “ 
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An examination of (18a) through (131) shows that the submatrix of 
J has the form D;Q:,;D,, where 


b; O O 
D;=/]0 1 O (14) 
0 0 Db 
and the matrix Q;; is not a function of any b;. It follows that J has the 
form 
J = DQD, (15) 
where 
D, O 
p=|9 % (16) 
D; 
and 
Qu at ay Qu 
Q=[: -. |. (17) 
Qx1 Ore 
O is a matrix whose elements are all zeros. From (15), 
Jo = DAQ=D-, (18) 


from which the theorem follows. For example, 
Var {41 — wi} 2 Q™/dj. (19) 


This theorem is not entirely new. It is alluded to in Ref. 6. However, 
this form of the theorem shows that, contrary to Ref. 6 and popular 
opinion, precisely known sampling times (or antenna element spacing 
in the equivalent radar problem) are not necessary for the theorem to 
hold. 

The theorem is true whether or not the noise samples are independent 
and regardless of the sampling times. Of course, if the sampling times 
are not known, then the c-r bounds cannot be accurately calculated, 
but that does not obviate the theorem. 

It should also be clear that the number of unknown parameters is 
unimportant to the theorem. Clearly the theorem holds if, for example, 


gi(wi, 0;, n) = COS (witn + 0), 
and if 
tn = nT. 


Theorem 2: The bounds associated with the parameters of the first k tones, 
when there are k + m tones, are not less than the bounds when there are 
only k tones. 
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Proof: The matrix J is always positive semidefinite. Thus, if it is not 
singular, it is positive definite. 


Suppose J is the Fisher information matrix for k + m tones and is 
partitioned so that J, is the J matrix associated with k of the tones. 
This partitioning is always possible. Then write 


J=|---—- el , (20) 
| 


Since J is positive definite, so are J; and Jm. 
Write the inverse of J in the form 


J SS tee eee eee ; (21) 


where J, and U are both 3k by 3k matrices. Theorem 2 is true if 
U:.= j;", (22) 


which means U — J; '‘ is positive semidefinite and which we now prove. 
Using the fact that 


JC =f, (23) 
one can show that 
U = [hk — KJn'K7 J". (24) 
Observe that KJ;,'’K* is positive semidefinite. That is, 
KJ=K? = 0. (25) 


Since J, and hence U are positive definite, (24) and (25) imply that 
U- <s Jz, which implies (22). 

Another implication of the proof of Theorem 2 is that the bounds 
for p of p +m unknown parameters are not less than the bounds 
when only the first p parameters are unknown. 

This theorem is also not entirely new, although we have not seen it 
stated before. A restricted version of the theorem is mentioned in 
Ref. 12, (page 33), and Problem 2.4.23 in Ref. 10 hints at this kind of 
result. 

The theorem depends upon J being nonsingular. It 1s easy, but 
tedious, to show that if the signal vector is composed of samples of the 
real or complex signal described in the introduction and only two tones 
are involved, then J is singular only if the two tone frequencies are 
equal, modulo 27/7’, where T is the intersample time. (Remember that 
a real tone has a component at +w; and another at —w;.) 
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We have not been able to prove this result for an arbitrary number 
of tones, but all of our calculations of various J matrices support the 
hypothesis that J is singular only if two or more of the tone frequencies 
are equal, modulo 27/T (assuming N is large enough). 

When two of the tone frequencies are equal, the receiver 1s receiving 
one less tone than expected. In this paper, we assume that the correct 
number of tones, k, is known and that all of the frequencies are distinct. 


2.2 Equally spaced samples and independent noise 


We now concentrate on the problem described in the introduction. 
Assume all noise samples are independent with variance o?. That is, 


1 
where I is an identity matrix. 
Define 
K 
bn = > b; cos (witn + 0:), (27) 
t=] 
K e 
Vn = », b; sin (win + 6), (28) 
t=1 
and 
t, = nT; n=0,1,---,;N —1. (29) 


As is mentioned in Ref. 1, the time of the first sample, fo, has an effect 
upon bounds and estimation accuracy. We have ignored that problem 
in this paper and taken ¢éo to be zero. 
The signal vector is 
v= an _ ee N i i (complex signal) (80) 
or 
Ve n=OtoN-1 (real signal). (31) 


Then a typical element of J is 
1%! | Olin On , OVn OVn | 


OQ, OAs OQ, Op 


Jar = (complex signal). (32) 














2 
Oo” n=0 


The », terms are dropped if the signal is real. 
Let M(w, @) be the matrix defined by 


T? >) rcosAn —T > nsin An T > ncos An 
M(w,@) = | T > nsin A, > cos An >, sin An , (33) 
T > ncos An — > sin An > cos An 
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where 


An = nol + 6; n=0OtoN —1. (34) 
Let P;; be the matrix defined by 
Pi; = M(w: — ;, 6: — 9;) (35) 
and let P be the p-by-p matrix defined by 
Py ac Pi, 
P=| : aan ie (36) 
Pri Pu 
Let 
1 0 0 
B=|0 —-1 O}- (37) 
0 0 1 


Define a matrix Q:; by 
Q4 = 3LM (a: — oj, 6: — 0;) — M(wi + o;,6; + 6;)B] (88) 
and a matrix Q by 


Qn <-: Qn 
Qn1 Sens Ore 
Then it can be shown (13) that J is given by: 
J= +, DPD complex tones (40) 
or 
J= +, DOD real tones. (41) 


Theorem 3: When the signal consists of two equal-level complex tones, 
the C-R bounds for the same parameters (e.g., the two frequencies) are 
equal. In other words, the mutual interference ts reciprocal. 


Proof: The J matrix is 


_1 7D, Oj}[ Pi Pi|{/Di 0 

J - og? 0 bl | et: = 0 na (42) 
because Py; = Poo = M(O, 0). Observe that Pf = BP:2B because 
M?7(w, 6) = BM (a, 6)B. Thus, J—' has the form 


[Dr oO}, U WD 0 
= 1% pel we vic pe): me 
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where 
V = BUB. (44) 

Hence, 
[Us| = [Vail (45) 


and U;; = Vi, which proves the theorem. 


Theorem 4: The bounds for two tones, real or complex, are periodic in 
6, and @, with period x. 


Proof: The theorem follows from the easily checked fact that 
M(w, 6+ 7) = —M (a, 6). 


Theorem 5: The bounds for real or complex tones are periodic in each 
frequency with period 2x/T. 


Proof: The theorem follows from the fact that M(w + 22/T, @) 
= M(uw, 6). 


Theorem 6: The bounds associated with complex tones depend upon the 
difference frequencies and phases but not upon the absolute values. 


Proof: The theorem follows from (35), (36), and (40). 


It is, in general, tedious to invert J and obtain formulas for the 
bounds. However, it is a simple matter to have a computer calculate 
the elements of J and its inverse..We have done this to obtain a better 
understanding of the bounds. 

A number of illustrative curves are given in Ref. 13. In the interest 
of brevity, we will present only two of the figures here. 

The main thing we learned from the calculations 1s that there is a 
critical frequency separation, 4r/NT, associated with multitone c-R 
bounds. In Ref. 1, it is shown that when a single complex tone is 
present, the bounds are independent of the frequency of the tone. When 
more than one complex tone is present, the bounds approach the single- 
complex-tone bounds when the minimum frequency separation (modulo _ 
27/T) exceeds the critical frequency. The multitone bounds increase 
rapidly as the minimum frequency separation goes below this critical 
frequency. 

This rule applies to a single real tone if it is considered to be two 
complex tones, one at a frequency, say, of w;, and one at —w,. Thus, 
if the frequency of a single real tone is less than 27/NT, modulo 7/T, 
then its c-R bounds are much larger than the corresponding single- 
complex-tone bounds. 

In all cases, the multitone bounds depend upon the tone phases, as 
might be expected. 
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BOUND ON CENTER TONE 


RMS IN HERTZ 


BOUND FOR. 
TWO TONES 





0 20 40 60 80 100 120 
Af IN HERTZ 


Fig. 1—Frequency estimation bound vs Af for center of three equally spaced 
complex tones with worst relative phase and 20 dB s/n. N is 128. 1/7 is 4000 Hz. 
Corresponding single and double tone bounds also shown. 


Figure 1 illustrates the critical frequency for frequency-estimation 
bounds. The worst phase, i.e., the phase that gives the largest frequency 
estimation bound, was used at each difference frequency. Figure 2 
shows the critical frequency effect upon the frequency-estimation 
bound for a single real tone. 

To facilitate comparisons, in all figures we used a sampling frequency 
of 4000 Hz for complex tones and 8000 Hz for real tones. Thus, in both 
cases, the unknown tone frequencies are assumed to fall in the range 
of 0 to 4000 Hz. 


fil. ESTIMATION ALGORITHMS 


3.1 General 


The Mu estimation procedure is conceptually simple. Given that a 
sample vector, X, is received, the Mu estimate of the parameter vector, 
@, 1s the value of a that maximizes the p.d.f. of X. That is, @ maximizes 
f(&/V). & may not be unique. Maximum likelihood estimation of the 
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parameters of a single complex tone has been shown to be relatively 
easy to implement.! It was shown in Ref. 1 that single complex-tone 
ML estimators have variances almost equal to the c-r bounds over a 
wide range of s/n. No other unbiased estimators could do significantly 
better over that range of s/n. 

Maximum likelihood estimation when several tones are present is 
much more difficult to implement. However, we show below ways to 
approximate ML estimation. We start the discussion with complex 
tones and examine a practical approximation to ML estimation, the 
resulting bias effects, the use of window functions to reduce bias, and 
a time-saving interpolation algorithm. Then we briefly discuss how the 
ideas and results apply to real tones. 

Recall from Ref. 1 that we seek to maximize the function 


2 ] 
L = XL (Xotin + Yorn) — 7 Wh + (46) 


where X, and Y, are as defined in (8) and (4). 
After carefully arranging the terms, we obtain the likelihood func- 
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Fig. 2—Frequency estimation bounds vs frequency for single real tone at 20 dB 
s/n and worst phase. 
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tion in a form similar to that used in Ref. 1: 


LS a {2b; Re [e-**A (w:) | — 03} 


7 = 5 oe ibs, Sees ia ne oe Bs OD) 
44mm n 
where 
1 N=! 
A(w) = W 2 (Xn + 7Vn)e7™r, (48) 


L as given by (47) has two main terms and would be difficult to 
maximize by a simple program. It can be done, but a lot of work is 
involved. We notice, however, that when there is only one tone (k = 1), 
the second term of (47) vanishes. Also, when N is large and k > 1, the 
magnitude of the second term is still relatively small and does not in- 
volve the data. Thus, we are led to drop the second term in Z and 
maximize the remainder. This, of course, will only give mu estimates 
when & = 1 and will give ‘‘almost mu” estimates otherwise. 


3.2 An almost ML algorithm 


Suppose the cross-product terms in (47) are dropped. Then to make 
estimates, we need to maximize 


k 
L, = ¥ 2b; Re [e-**A (w,)] — BF. (49) 


t=1 


From Ref. 1, each frequency estimate, 6; maximizes |A(w)|. Then the 
corresponding level and phase estimates are 


6; = |A(4,)| (50) 
and 
6; = Ee [A (w;) |. (51) 


The function |A(w)| has many maxima and large peaks near the 
frequency of each tone. Thus, the frequencies of these large peaks, as 
illustrated in Fig. 3, are taken to be the frequency estimates, 4;. Due 
to the periodicity of |A(w)|, all the w; should be confined to a range 
no wider than w, = 27/T to avoid ambiguous frequency estimates. 
Normally the range (0, 27/T) is used. When real tones are involved, 
the range should not exceed 7/T. 


3.3 Bias 


Consider the case of only two tones. An example of |A (w) | when the 
noise power is zero is shown on Fig. 3. 
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Fig. 3—Shape of | A (w)| from two complex tones of equal phase and level, without 
noise. N is 16. 


The figure has large peaks near w; and we. The peaks in the example 
are actually both displaced away from the average of the two fre- 
quencies. Thus, the penalty for neglecting the cross-product term in 
(47) 1s a bias in estimates of frequencies and levels. 

The frequency and level bias in the zero-noise case is easily cal- 
culated. An example of such calculations is shown on Fig. 4. The figure 
shows the dependence of frequency estimation bias on the difference 
frequency (Af) of the two tones. When two tones have almost the same 
frequency, the two large peaks merge into one at a frequency equal to 
the average of the two tone frequencies. This accounts for the negative 
slope of —§ at low Af on Fig. 4. There is also a dependence upon the 
difference phase (Aé). 

Figure 4 shows the bias for one of the two complex tones. The bias 
for the other has the same magnitude but opposite sign. In general, 
the magnitudes of the biases for two tones are not equal. However, 
they are equal when the two tones are equal-level complex tones. 


3.4 Window functions 


In discrete Fourier transform (pFT) work, window functions (also 
called weighting functions) are often used to minimize the effects of 
one tone upon another. The modification of the prr of samples of one 
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Fig. 4—Bias in peak frequency of |A (w)| from two pee equal-phase com- 
plex tones vs difference frequency, without noise. N is 6 


tone by the presence of samples of another tone is called leakage. See 
Rife and Vincent" for a discussion of leakage and how window function 
will reduce it. 

In the time domain, a window function (or time window), say A(é), 
is characterized by its samples, h(n7). In use, each data sample, 
Xn +jV¥n = Zn, is multiplied by hn = h(nT) before A (w) is computed. 
Thus, A(w) becomes 


A(w) = ds hind ne ner, (52) 


=| 


When a window function is used, the bias in the frequencies of the peaks 
of | A (w) | is modified. If a good window is used, the bias can be greatly 
reduced. The penalty, as we see below, is an increase in the variance 
of 8; and 6,. Palmer also reported this penalty in Ref. 2. 

In the context of the prt, window functions can be written in the 
form 


ee 5g wes NY: (53) 
t=) 


The number M, which can be assumed to be less than N/2, and the d; 
define particular windows. With h, in this form, 


oe 
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Table |—Values of d; in ascending order of j for 
various windows 


Hanning Standard Taylor 
—] — 1.43596 — 1.03538 
0.497536 0.0824936 

—0.061576 —0.00116197 
—0.00188862 
—0.00123387 
—0.000671595 
—0.000275885 


A window that is better than many at reducing bias is the one 
identified by Rife and Vincent" as g3(t). We call this the standard 
window. Another useful window is one of the Taylor windows.'* These 
windows are defined in Table I. 

Figure 5 is an attempt to summarize the way window functions 
affect bias. The curves on the figure compare upper bounds to the bias 
associated with each of the previously defined window functions. The 
curves were obtained by computing at each frequency the bias at the 
worst phase (the phase that gave the largest bias). The resulting curves 
were flattened as indicated for the Taylor curve. 

Figure 5 shows the Taylor window does the best job when the tone 
frequency separation is small. At large separations, however, the 
standard window does much better. The figure also shows how bad 
the bias is if no window 1s used. 

Windowing increases the variance of frequency and level estimates. 
It can be shown* that the increase in variance is related to the function. 


1 N=-1 
n= a hn. (54) 


It is easy to show that 


M 
n=1+232 da, if N > 2M. (55) 
i=1 
Thus, 7 1s not a function of N. Simulations verify that larger RMs 


errors are associated with larger values of 7. Some values of 7 are 
tabulated below. 


Window n 
None 1.00 
Hanning 1.50 
Taylor 1.54 
Standard 2.16 — 
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Durrani et al. call the sum 7 a dispersion factor.. They have com- 
pared many windows and have tabulated their parameters, including 
dispersion factors. Other windows are mentioned in Blackman and 
Tukey.!® 

The data on Fig. 6 shows the general effects of windows on RMS 
errors when a single complex tone is present. The Hanning window 
produces almost the same RMS error as the Taylor window and is not 
shown on the figure. 

Bias contributes to RMs errors more than variance does at high s/n, 


_-NO WINDOW 


0.4 _-— HANNING 


— TAYLOR 
“ 


0.2 
~ 
f\ 


[f#| IN HERTZ 


0.1 \ 
0.08 


| 
| 
| 
0.06 | 
| _ ACTUAL 
0.04 ja ~~ TAYLOR 
| 
| 


\ 
\ 
\ 
| 
| 
| 


0.02 


0.01 — — STANDARD 
0.008 


0.006 





0.004 
0 100 200 300 400 500 600 


Af IN HERTZ 


Fig. 5—Magnitude of frequency estimation bias for two equal-level complex tones 
using window functions. Curves are leveled as described in the text. Worst-phase was 
used at each frequency. J is 64. 
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Fig. 6—Simulation results showing the effect of window functions on estimation 
variance with a single complex tone. N is 64. 


while estimation variance controls RMS errors at low s/n. Thus, while 
a given window may produce lower RMs errors than another at high 
s/n, the roles may be reversed at low s/n. The “‘best’’ window for a 
given application will, therefore, depend upon the tone frequency 
spacings, the expected s/n, and possibly other factors. Figure 7 illus- 
trates this point. On the figure, the Taylor window is best at 10 dB 
s/n, but the standard window is best at 40 dB s/n, where the bias 
associated with the Taylor window causes the Ras error curve to level 
off. 


3.5 Interpolation 
Maximization of |A(w)| involves a search routine. A two-step 
algorithm that has a coarse search and a fine search was described in 


TONE PARAMETER ESTIMATION 1405 


NO WINDOW 
ra 


TAYLOR 


77 WINDOW 


RMS IN HERTZ 


_ STANDARD 
—~ WINDOW 


| 
C—R BOUND 





0 10 20 30 40 50 
s/n IN dB 


Fig. 7—Simulation results showing combined effects of bias and variance on 
frequency estimation for one of two complex tones. Frequency difference is 380 Hz; 
worst-phase was used for each window. JN is 64. 


Ref. 1. Fine searches are time consuming. This can be serious if com- 
puter time is important. One way to trade accuracy for speed is to use 
an interpolation algorithm on the prv of the input data to arrive at 
frequency and level estimates. 

Rife and Vincent developed several interpolation algorithms." The 
one we investigate here is the following. 

Assume the output of the Frt is the set: 

1 N=! . 
A; = WV pF NL ne a nklN k=OtoN —1. (56) 

Suppose a coarse search is conducted over 0 < k < N. This results in 
locating |A1| which is the largest |A;z| in the interval. Choosea = +1 
such that |Ai2| = |Are| 
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Let 


ay = |Aa| (57) 
and 
ag = | Atte | ‘. (58) 


Assume the sampling frequency is w, = 27/T. 
The formulas from Rife and Vincent are: 


O= 7 (1 + a8) (59) 
and 
hes ee (60) 
snem) E 4 dy8t/(8 — n’)| 
n=l 
where the d, define a window and 
5 — (ide — Crd (61) 


C3d2 + Qi 


The numbers C;, C2, and C3 are given by Rife and Vincent in Table 
II for several windows. 

The interpolation formulas give estimates that are only a little 
worse than the fine search gives. rms frequency errors are typically 
increased by about 30 percent when interpolation is used. Rms level 
errors increase less. 

When many tones are present, window functions can provide a 
satisfactory reduction of leakage as long as the minimum frequency 
separation is no less than about 8r/NT. The data on Fig. 8 illustrate 
this point. The tone phases were all made random for these simulations. 
Thus, the points indicate the Rms errors one might encounter in a 
working system. The bound shown on the figure is the (unbiased) c-R 
bound maximized over the possible phases of the center tone. 

We consider a real-tone estimation system to be equivalent to a 
complex-tone system if the two systems have the same useful band- 
width and the same frequency resolution. This means (2) the real 
sampling frequency is twice the complex sampling frequency and (72) 
the total sampling time, Nt, is the same for the real tones as for the 


Table 1|— Constants for computing delta in eq. (61) 
Window Cy C2 C3 
None 1 0 1 
Hanning 2 1 1 
Taylor 1.96339 1.01643 0.893534 
Standard 3.6020 2.5862 1.0317 
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Fig. 8—Simulation results showing effects of Taylor window and interpolation 
algorithm upon frequency estimates of center tone of three equally spaced real tones. 
Center tone frequency is 2000 Hz. All have random phase. N is 128 and s/n is 20 dB. 


complex. For example, a real-tone system using 1/7’ = 8000 Hz and 
N = 382 1s equivalent to a complex-tone system using 1/7 = 4000 Hz 
and N = 16. 

The estimation algorithms described above for complex tones can 
be applied to real tones whose frequencies, in Hz, are in the range 
(1/NT, 1/2T — 1/NT). The resulting accuracies are about the same 
as In the equivalent complex case. 


IV. CONCLUSIONS 


We have studied the problem of estimating the parameters, such as 
level and frequency, of several sinusoidal signals from a number of 
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noisy observations, taken at discrete-time instants. Gaussian noise and 
ideal analog-to-digital conversion were assumed. The nature of the 
problem led us to study the generalized Cramér-Rao lower bounds to 
estimation accuracy and maximum likelihood estimation. The com- 
plexity of maximum likelihood estimation algorithms led us to examine 
several algorithms that yield estimates that are almost, but not 
exactly, maximum likelihood estimates of the signal parameters. 

We were able to obtain estimators that have negligible bias, at least 
at high s/n. Thus, we considered in detail only the generalized c-rR 
bound for unbiased estimators. Even when the resulting numbers are 
not, strictly speaking, lower bounds (e.g., when an estimator is biased), 
the unbiased estimation bounds can be considered to be desirable 
objectives for estimators. 

Several properties of the bounds were derived from the properties 
of the J matrix. Other properties, such as the existence of critical 
frequencies, were revealed from computations. 

The J matrix in the real tone cases is more complicated than in the 
complex cases and does not have quite the same structural properties. 
Thus, for example, the lower bounds for a single complex tone are not 
also lower bounds for the equivalent single real tone. On the other 
hand, the bounds for the case of many real tones approach the bounds 
for the equivalent complex cases when none of the real tones have 
frequency differences less than 2/NT, modulo 1/27’ (in Hz). 

The cases of many complex tones and of real tones present some 
difficulties. Maximum likelihood estimation is difficult to implement 
because of the presence of cross-product terms. To properly implement 
ML estimation, multidimensional search procedures over a nonconvex 
function would be necessary. We found that when the tone frequencies 
are separated far enough, the cross-product terms could be neglected, 
thereby permitting the use of a simple algorithm whose estimates are 
almost equal to ML estimates. a 

The penalty for dropping the cross-product terms is a bias in fre- 
quency and level estimates. We found that the use of a suitable window 
function will reduce the bias to the point where it can be neglected 
when the minimum frequency separation of the tones is 4/NT7. Three 
window functions were discussed and compared. 

We found that the use of a window to reduce bias increased the 
variance of the frequency and level estimates. The ras error of fre- 
quency estimates is increased by about 35 percent with Taylor window 
and by over 100 percent with standard window. The use of the in- 
terpolation formulas increases RMS frequency errors by another 30 per- 
cent or so. Level estimates are affected less by windows and interpola- 
tion. All of these figures apply when the s/n is above threshold. 
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On the Rearrangeability of Some Multistage 


Connecting Networks 


By F. K. HWANG 
(Manuscript received July 24, 1975) 


We generalize the concept of rearrangeability to a finer measure of the 
connecting power of a network, called the c-rearrangeability function. It 
can be interpreted as the proportion of calls a network guarantees to 
connect under a given traffic load. We study the c-rearrangeability function 
for many well-known rearrangeable networks, including one-sided rear- 
rangeable, two-sided rearrangeable, as well as several other kinds. We also 
give constructions for some new classes of networks, study their c-rear- 
rangeability functions, and describe conditions under which the networks 
are rearrangeable. We show that these newly constructed rearrangeable net- 
works compare favorably with the well-known ones with respect to the 
number of crosspoints. 


l. INTRODUCTION 


A multistage connecting network can be described by the following 
(see Fig. 1 for a three-stage example): 


(2) 


(22) 


(271) 


There are s ordered stages, where s > 1 is arbitrary. The 7th 
stage, 7 = 1, ---, s, consists of r; copies of a switch v;. The jth 
copy of v; 1s denoted by »,;;. 

Links can exist only between switches of adjacent stages or 
between vi(y,) and input (output) terminals of the network. 
The set of links incident to a particular »; 1s partitioned into 
two subsets. Those which are linked to either »;_; or input 
terminals are called input links of v;, and those linked to either 
vit, or output terminals are called output links. 

The 7; copies of »; are called input switches of the network. 
Each y; 1s connected to n; input terminals. The r, copies of 
vy, are called output switches of the network. Each », is con- 
nected to n, output terminals. 


The three-stage Clos network is a special case of a multistage con- 
necting network, satisfying the additional restrictions that s = 3 and 
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STAGE 1 STAGE 2 STAGE 3 
Fig. 1—Generalized three-step Clos network. 


that there is exactly one link between every pair (v1, v2) and every 
pair (ve, v3). When 7, ve, v3 themselves are allowed to be multistage 
connecting networks, then a three-stage Clos network is called a 
generalized Clos network. For simplicity, we assume n,; = n3 = n 
throughout this paper. Then, a generalized Clos network can be 
denoted by C'(v1, v2, v3, 71, 72, 73, 2) (see Fig. 1). 

Define a request to be a pair of idle terminals seeking connection. 
A request becomes a call once the two terminals are connected in the 
network. An assignment is a.set of requests and the size of an assign- 
ment is the number of requests in it. An assignment Is said to be realiz- 
able if every request in it can be simultaneously connected in the 
network without any link being used more than once. A network 1s 
said to be rearrangeable if it can realize every possible assignment. 

Consider a multistage connecting network v. Let g be the set of 
input terminals, © the set of output terminals of vy, and J + © = JT. In 
many actual cases, not every: possible pair in 7 will generate a request. 
In general, there could be two subsets J, Q € T' such that all requests 
are generated in the product space J X Q. However, the four most 
important cases are: 


(2) the one-sided case: = Q = T. 

(21) the two-sided case: [ = 9,2 = 0. 
(412) the input-mixed case: / = 9,Q = T. 
(iv) the output-mixed case: J = T, 2 = 0. 


The last two are often combined and called the mixed case. 

A network is said to be one-sided rearrangeable if it can realize 
every one-sided assignment. Similarly, we can define two-sided re- 
arrangeable, input-mixed rearrangeable, and output-mixed rearrange- 
able. Thus, a one-sided rearrangeable network means that every set 
of pairs of terminals can be simultaneously connected, and a two-sided 
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rearrangeable network means that every set of (input link—-output 
link) terminals can be simultaneously connected. It is clear that one- 
sided rearrangeability implies mixed rearrangeability, which, in turn, 
implies two-sided rearrangeability. 

Rearrangeability is a strong condition which is manifested in two 
aspects. First, all the requests In an assignment must be simultaneously 
connected ; 1.e., if one request fails, the whole assignment fails. Second, 
every assignment must be realizable; 1.e., if one assignment fails, the 
whole network fails. Even for a nonrearrangeable network, it is still 
of interest to know the degree of its nonrearrangeability. We introduce 
a new concept of rearrangeability in this direction. First, we score an 
assignment by the largest number of requests it can guarantee to 
connect simultaneously. Second, we partition the set of all assignments 
into classes according to the size of an assignment. We score a class by 
the lowest score achieved by any member in this class. Now, a bad 
assignment can still bring down the score of its class, but not of the 
other classes, and not to a score of zero. Thus, we define R,(c), the 
c-rearrangeability function, as the largest number of requests the 
network v can guarantee to connect given any assignment of size c. 
Thus, R,(c)/c is the proportion of requests vy can guarantee to connect 
given that the traffic load is approximately c/(capacity of v). When 
R,(c) = c, we say v is c-rearrangeable. If v is c-rearrangeable for all c, 
then ¢ is rearrangeable in the classical sense. 

In this paper, we study the c-rearrangeability functions for some 
well-known rearrangeable networks. We also construct some new 
classes of networks, study their c-rearrangeability functions, and de- 
scribe conditions under which the networks are rearrangeable. We show 
that these newly constructed rearrangeable networks can save a sig- 
nificant number of crosspoints over the well-known networks. 


ll. ANALYSES OF SOME WELL-KNOWN REARRANGEABLE NETWORKS 


As switches are the basic components of a network, to understand 
the rearrangeable property of a network, we have first to know what 
the rearrangeable properties of its switches are (the switches mentioned 
in this paper are all cross-point grid switches). For a switch, the 
definition of rearrangeability is similar to that for networks, except 
that input links and output links replace the roles of input terminals 
and output terminals in a network. 

Two links of a switch have direct access to each other if they inter- 
sect at a crosspoint. In many networks, the cost of crosspoints still 
dominates the other costs. Therefore, we would like to minimize the 
number of crosspoints in a network. A relevant question is, for a given 
rearrangeable property, which switch has the minimum number of 
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crosspoints? This problem has recently been solved by Chung.! How- 
ever, In our networks, we will stick to the more traditional switches 
for their engineering feasibility and for ease of comparisons with 
existing networks. Consider a switch with n input links and m output 
links. It is called a triangular switch if there is a crosspoint between 
every pair of links, input or output. Therefore, a triangular switch 
has [(n + m)(n + m — 1)]/2 crosspoints and is clearly one-sided 
rearrangeable. The switch is called a rectangular switch if there is a 
crosspoint between every input link and every output link. 

A rectangular switch has n X m crosspoints and is two-sided re- 
arrangeable. The switch is called a trapezoidal switch if there is a 
crosspoint between every pair of links with at least one of the links be- 
longing to a fixed side. A trapezoidal switch either has n(n — 1)/2 + nm 
or m(m — 1)/2 + nm crosspoints and is either input-mixed or output- 
mixed rearrangeable, depending on which side is the fixed side. Note 
that an n X m rectangular switch is in fact input-mixed rearrangeable 
ifm 2 n — 1. This is because any pair of input links can be connected 
through an output link, and there are always enough output links to 
do it. While the existing networks always use trapezoidal switches 
when mixed-rearrangeable switches are needed, we will use rectangular 
switches to save crosspoints when the condition m 2 n — 1 for input- 
mixed and n = m — 1 for output-mixed is met. 

In every network we discuss in this paper, v1; and », are always 
assumed to be two-sided rearrangeable (or stronger). Hence, two 
terminals from two distinct »; and/or pv, can be connected if and only 
if their corresponding switches »:(y;) can be connected. Thus, we can 
redefine a request as a pair of v1 and/or vy, and an assignment as a 
collection of requests where each »; or v,; can appear at most n times. 
If a request 1s (v1;, viz) Or (v5;, vsj), then we have to discuss separately 
how they can be connected. 

Consider vy = C(v,, v2, v3, 11, 12; 3, N) Shown in Fig. 2. (In our figures, 
A\ represents a one-sided rearrangeable v;, (| a two-sided rearrangeable 





Fig. 2—Ordinary three-stage Clos network. 
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vi, ZL] an input-mixed rearrangeable v,;, and LX an output-mixed 
rearrangeable v;. The number of input links and output links will be 
shown inside these figures.) 


Theorem 1: (See Slepian,? Duguid,’ and BeneS.*) v 2s two-sided rearrange- 
able 1f and only af re = n. 


Theorem 2: 


min {ro/n, 1}, for c>1re; 


R,(c)/c =1, for c Sfo. 


Proof: Actually, we will prove an exact expression for R,(c)/c. We need 
only consider the case n > re and ¢ > re since the other cases are 
trivial. Consider any assignment of size c. Let c = pn +q where 
0 Sq <n. We can assume that the re real ve are embedded in a set 
of n Imaginary ve. Then by Theorem 1, all requests can be simul- 
taneously connected by the n vz. Rank the n v2 according to the number 
of calls they carry and select the re v2 with the highest ranks to be the 
re real ones. They must carry a total number of calls not less than 
min {re, g} X (p + 1) + max {re — gq, 0} X p. On the other hand, 
when all the requests in an assignment involve only a few v1, say as 
few as possible, then every v2 carries essentially the same number of 
calls, differing at most by one. Hence, 


_jrp+g/(npt+g), wore 2 q; 
eee (rep + 72)/(np+q), lf r<4¥. 


Theorem 2 gives a good approximation to this when ¢ and rep are 
large relative to q. 

To compute the number of crosspoints of a network, we always 
make the simplifying assumptions that r; = r; = n and all v; are 
nonblocking switches so that we can easily compare the various net- 
works. Under these assumptions, then, the current network for rz = n 
has 373 crosspoints. 

Next consider vg = C'(y1, v2, v3, 71, 72) 73, 2) Shown in Fig. 3a. 


Theorem 3:°-" y ts one-sided rearrangeable if and only if re 2 |.8n/2 4], 
where |x| 1s, as usual, the integer part of x. 


Theorem 4: 
. 3n 
~ min —|,1l+, f > £23 
R,(c)/c {r3/| 5 | or c>fe 


=A; for cS fo. 


Proof: Again, we need only consider the case [3n/2] > re and c > ro. 
The proof that R,(c)/c = r2/|8n/2| uses a similar argument to that 
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Fig. 3—Mixed three-stage Clos networks. 


in the proof of Theorem 2. To prove the reverse inequality, label all 
the v; and v3 by the numbers | tor; + 73. Consider the | 8n/2 | requests, 
(1, 2); ng Cl; 2); (Ln/2 ] of them), 
(1, 3), re (1, 3), (Ln/2] of them), 
(2,3), °°, (2,3), (L(+ 1)/2] of them). 
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No two of them can be carried by the same ve, since otherwise the 
two will share a link. Consider an assignment whose requests can be 
partitioned into sets of | 3n/2]| requests of the above type and a re- 
mainder set which is a subset of the above type of |3n/2_] requests. 
Then the number of calls carried by each ve can differ at most by one. 
Hence, R,(c)/c S re/|.3n/2| + (a constant)/c & r2/[3n/2 |. The proof 
is completed. 


Under the simplifying assumptions previously stated, the number 
of crosspoints for this network for re = [8n/2_] is 7n’. Note that this 
compares favorably with the one-sided network’ obtained from a two- 
sided rearrangeable network where both sides are 9 + 0. Such a 
network needs 12n’ crosspoints. It is also better than the one obtained 
by joining three two-sided networks together, the first having 9g in 
both sides, the second having 0 in both sides, and the third having 9 | 
in one side and © the other. Such a network needs 9n* crosspoints. 

We can easily obtain an input-mixed rearrangeable network from 
the above one-sided rearrangeable network by changing v2 from one- 
sided rearrangeable to input-mixed rearrangeable and v3 from output- 
_ mixed rearrangeable to two-sided rearrangeable (Fig. 3b). Let v be the 
network shown in Fig. 3b. 


Theorem 8: 


. 3n 
~ min jr — |,1l}+, for c>?., 
R,(e)/ce | :/| 2 ; 


= J, for cSfs. 


Proof: The proof is similar to the proofs for Theorems 3 and 4. 

For re = 3n/2, this network has (23/4)n? crosspoints. However, we 
can also obtain an input-mixed rearrangeable network by joining two 
two-sided rearrangeable networks together; one is (J, ©)-two-sided 
and the other has J in both sides. Such a network needs 6n' crosspoints. 


Hl. A NEW INPUT-MIXED REARRANGEABLE NETWORK 


Consider v = C(v4, v2, v3, 71, 2, 73, 2) Shown in Fig. 3c. 
Theorem 6: v is nput-mixed rearrangeable if 


(2) re 2 N; 


— 


(27) (re — 1)r3 = nr. 


Proof: We first explain how a request is connected in this network. A 
(viz, vaj) request is still connected through some v2 which has an idle 
link to vi; and an idle link to v3; just as is done in the networks of 
Section II. But a (v1:, »1;) request cannot be connected in this manner 
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since v2 cannot connect two input links. Instead, we will connect both 
vi; and v1; to some v3, and then use the input-mixed rearrangeable 
property of v3 to complete the connection. One question is whether 
there are enough input links of v3 to accommodate all (v1, v1) requests. 
Now each (v1, v3) request takes up one input link of »; and one of ps, 
and each (vy, v1) request takes up two input links of v; and two of 3. 
Hence, regardless of the distribution of (v1, v1) requests relative to the 
(v1, v3) requests, the maximum number of v3 input links needed is, 
except for a minor correction, the maximum number of »; input links 
available, which is nr1. The minor correction is because each (71, v3) 
request takes up a pair of v3 input links from the same v3. Hence, 
occasionally, a v3 input link may be wasted since it has no partner. 
Discounting one input link from each »3, we obtain condition (72). 

If condition (22) is satisfied, then each (v1:, v1;) request can be re- 
placed by two requests (71;, v3.) and (¥1;, vsx). Hence, an input assign- 
ment is turned into a two-sided assignment. The requirement that each 
v3, Yaust appear no more than n times is irrelevant here because the 
connection of (v1;, v3,) does not involve any output links of v3,. By 
Theorem 1, the derived two-sided assignment is rearrangeable if re 2 n. 
Theorem 6 is proved. 


If r; = 73, then re = n + 1 satisfies both conditions of Theorem 6. 
Furthermore, since the size is right, we can use rectangular switches 
for v3 for mixed-rearrangeable property. This network has 3n* + 3n? 
crosspoints (under the simplifying assumptions) as compared to 
(23/4)n3 for the input-mixed rearrangeable network in Section II. 

Since a (v1; v1;) request takes twice as many links to connect as a 
(viz, v3;) request, one might suspect that the blocking probability for 
the former request is much larger. This is not necessarily true, however, 
since there are only re distinct connecting paths of two links for a 
(viz, v33) request but (¥)r3 paths of four links for a (vy; »1;) request. 


Theorem 7: Let v be the network in Theorem 6. Then 


max {2c — nr1,0} + (re — Lr r 


Hehe r 
R,(c)/c = min h, 3 x min fe 5c 


Proof: For the time being, suppose 72 = n. Consider any assignment 
of size c and let w be the number of (1, v3)-type requests in it. Then 
u = max {2c — nr,, 0} since u + 2(c — u) = 2c — u input links of 1 
are required while only nr, are available. If there are not enough input 
links of v3 to take care of all c requests, then priority should be given 
to (v1, v3) type requests to maximize the number of requests connected. 
The priority is due to the fact that a (v;, v3) request needs only one 
input link of v3 while a (71, v1) request needs two. For u 2 fers, the 


1418 THE BELL SYSTEM TECHNICAL JOURNAL, NOVEMBER 1976 


maximum number of requests connectable is rer3; for u < rer3, the 
maximum is approximately 


(ro — 1)r3 — u OS (ro — 1)rs3 
2 Z 


The worst case occurs when wu is at its minimum, L.e., w= max 
xX {2c — nr, 0}. But still the network guarantees to connect at least 
4 [max (2c — nri, O} 4+ (re — 1)rs] requests. Now look at the dis- 
tribution of these calls in the ve. If 72 < n, select the reve that carry 
the most calls. In this way, we obtain Theorem 7. 


U + 


< To?3. 


Corollary: v ts input-mixed c-rearrangeable if rz = n and either 


(ro — 1)r3 
2 


{ro — Ura, 141, 
z = 9 


IA 
IV 


c or -¢ 
IV. A NEW ONE-SIDED REARRANGEABLE NETWORK 

Consider »v = C(n1, v2, v3, 11, 12, 73, 2), Where vy; and v3 are input- 
and output-mixed rearrangeable and v2 is one-sided rearrangeable. 
Also assume n and rz are even. We construct a v’ from »v by inserting 
something between the pair (v2,2:-1, v2,2:) foreach z = 1, -+-, 72/2 (see 
Fig. 4) to provide some limited access between links of ve,2;-1 and links 
of vo,2;, One way to do this is to insert two two-sided rearrangeable 
networks yw; and pws between v2.1 and ve,e;. The input links of pis 
are the extensions of the n links of ve,o;-1 and the output links of pie 
are the extensions of the n links of vo,2;. wi: has 4(7r1 + rs) output links 
that become the input links of yi2. Thus any link of v2,2;-1 can seize 
an output link of ui and then connect to any link of v2,o; In pie. OF 
course, }(r; + 73) such connections can be made simultaneously. 





Fig. 4—One-sided rearrangeable networks. 
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Theorem 8: v 1s one-sided rearrangeable if and only tf r2 = n. 


Proof: For a given assignment, define an assignment graph by taking 
all vy; and v3 as vertices and every request as an edge. We can augment 
the assignment graph to become a regular graph of degree n by adding 
suitable edges to it. By a theorem of Petersen (see Refs. 9 or 10), a 
regular graph of even degree is 2-factorable; i.e., the assignment graph 
can be decomposed into (n/2) 2-factors, where a 2-factor is a subgraph 
in which every vertex is of degree 2. Hence, a 2-factor consists of a set 
of disjoint circuits. Now any circuit of length 1 represents a request 
from two terminals of the same switch. We can connect them within 
that switch because of its mixed-rearrangeable property. Aside from 
that, we can partition all edges in an odd circuit into three sets such 
that edges in the same set are all disjoint; and we can partition all 
edges in an even circuit into two such sets. Since all circuits in a 2-factor 
are disjoint, we can combine those sets into three large sets A;, A2, A3, 
such that the edges in each large set are all disjoint. As each edge 
represents a request, all the requests in A; can be connected through, 
Say, ¥e1, since they are all disjoint (we can ignore those edges which 
are augmented to the assignment graph). Similarly, all the requests 
in Az can be connected through ve. For a request in As, say (a, y), if 
(x, y) is disjoint with every request in A;(A2), then we can connect 
it through v1(v22). Otherwise, suppose x has appeared in A, and y 
in Ae. Then we connect x to vez and y to v2: and then connect them 
through wi: and wie. We do this for every request in A;. Therefore, all 
requests in a 2-factor can be connected by a pair of ve. There are n/2 
2-factors; hence, n/2 pairs of v2 will suffice. If we have less than n/2 
pairs of v2, then there is no way to handle the 3n/2 requests given in 
the proof of Theorem 4. Hence, Theorem 8 is proved. 


Theorem 9: 


=min| 2,1}, for ¢ > 22, 
R,(c)/c ; 
=|, for cat 


Proof: Omitted. 

For re = n, this network has (16/3)n® crosspoints versus the 7n3 
for the standard one-sided rearrangeable network. 
V. A c-REARRANGEABILITY THEOREM 


We have seen that the c-rearrangeability functions of many net- 
works discussed in previous sections are such that R,(c)/c=&a, a 
constant, over most of the range of c. Consider »v = C(11, v2, v3, 11, 
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re, 3, 2) such that R,,(c)/c & a; for two-sided assignment. What can 
we say about F,(c) for two-sided assignment? If the blocking in », 
ve, v3 and the blocking due to the structure of v all act independently, 
then we should have 


i) 


Ble) = min’; Lo —— 
r(c) SS ararsees ’ max fay, a3} X 2 


The reason for the last term is because at most max {a, a3} X n 
requests from the same switch can get through to the second stage. 
However, we show that the blocking in »; and the blocking in »3 can 
be coordinated so that the requests blocked in one stage form a subset 
of the requests blocked in the other stage. Without loss of generality, 
suppose a; S a3. Then 


Theorem 10: 
R,(c) = A1Q1 min H, a ‘ 


ain 
Proof: For any given assignment, consider its bipartite assignment 
graph G. Let d; be the degree of vertex 7. We want to find a subgraph 
G’ such that the degree of vertex v in G’ is d; = aid; (treating it as an 
integer). Then G’ is the set of requests that will get through both » 
and V3. 

Let V; be the set of vertices corresponding to »; and V3; the set 
corresponding to v3. Let d(X) denote the sum of degrees over all 
vertices of X in G, and define ad’(X) similarly for X in G’. Finally, let 
dy(X) denote the degree sum of X in G when the set Y is deleted from 
G. Then a theorem of Gale" on network flows has the following 
interpretation.” 


Gale’s Theorem: G’ exists if and only if there do not exist two sets S C Vi, 
T C V3 such that either 


d’(S) > d(T) + dr(S), 
or 


d(T) > d'(S) + ds(T). 


In our case, d’(S) = aid(S) and d’(T) = aid(T). Without loss of 
generality, suppose d(S) 2 d(T). Then the second inequality in Gale’s 
theorem certainly cannot hold. To check the first, note that 


dr(S) 2 d(S) — d(T) 2 aid(S) — d(T) = d'(S) — a’(T). 


Hence, the first inequality also does not hold. We conclude that G’ 
exists and Theorem 10 is proved. 


When the involved numbers are large, the discrepancy caused by 
assuming a,d; an integer is certainly negligible. 
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A fixed-point algorithm has been used to obtain the parameters (1.e., 
decision and representative levels) of an “‘optimum’” quantizer that mini- 
mizes a quite general distortion measure, subject to an entropy constraint 
on its output. Construction of the algorithm starts with a povnt-to-set 
mapping whose fixed point satisfies the well-known Karush-Kuhn-Tucker 
conditions necessary for a local extremum. A computer program is then 
used to determine a fixed point of this mapping. Several examples are 
solved, and correspondence with the existing results in the literature is 
pointed out. Finally, as conjectured, the growth of the computations as a 
function of dimensionality n (n: number of representative levels) 1s found 
to be of the form a-n® where a is a positive constant and 1.5 = 6 S 2.0. 


I. INTRODUCTION 


Simple quantization'~* has been and continues to be a popular 
method of digitizing analog signals. The relative ease with which 
quantizers can be implemented in hardware and their near optimum 
performance has made them withstand the challenge from several 
new coding schemes.*-® Universal use of quantizers has naturally 
spurred a significant activity in optimizing their performance, some 
of which is summarized in the next few paragraphs. Our objective 
in this paper is to show how the problem of obtaining the parameters 
of an optimum quantizer can be converted to the problem of obtaining 
fixed points of a suitably constructed mapping and then to use a 
fixed-point algorithm to solve the problem numerically. 

Quantizers have been optimized based on several criteria. In order 
to discuss these in relation to the problem considered in this paper, we 
describe the basic quantizer equations. Given a scalar random variable 
T with probability density p(t), a quantizer Q is a map Q(t) = y: 
whenever 2; S ¢ < 241, where z;, 7 = 1, ---, N+ 1 and y;, 7 = 1, 
-++, N are the decision and representative levels of the quantizer, 
respectively. The performance of the quantizer is judged generally in 
terms of two quantities: 

1423 


the distortion 


N Lit 
D= p> gt — ys) X fOdt, (1) 
and the entropy 
N 
6é=— Pz (loge pi) X pi, (2) 


where g is a nonnegative function and f is a nonnegative weighting 
function that weights the quantization noise and 


Lit 
Di = a p(tyadt. 


Optimum quantizers choose their parameters {xv,;}, 7 = 2, ---, N and 
{yi},7 = 1, ---, N (given the end points x1, xy+1) to optimize a certain 
combination of D and 6. 

Most quantization literature uses the weighting function f to be the 
same as the probability function p, although in some applications’® a 
different weighting function performs better. Most of the earlier work 
is concerned with minimizing D for a given number of levels. Panter 
and Dite” have used g(-) = |(-)|*(r > 0) and obtained an approxi- 
mate optimum quantizer as one in which each of the quantizing in- 
tervals [x., 2:1.) makes an equal contribution to the integral of 
|(¢ — y.)|*. This allowed them to choose the quantizer parameters for 
large N. Lloyd" and Max” have developed an algorithm for r = 2, 
which corresponds to minimizing the mean square error. Bruce!’ has 
used dynamic programming to solve the same problem in slightly more 
generality by taking a general function g(-). Simpler suboptimal algo- 
rithms and bounds on the performance of the quantizers have been 
obtained by Roe,!* Algazi,!® and Zador.'® 

Representation of the quantizer output by a variable length code 
allows reduction of the average bit rate of the quantizer when 7; varies 
with 7. Use of Huffman code!” makes the average bit rate approach 
the entropy of the quantizer output. Thus, the problem of designing 
an optimum!*-!9 quantizer can be reformulated as that of obtaining the 
decision and representative levels to minimize D subject to a constraint 
on the entropy. Goblick and Holsinger have considered this problem 
for uniform quantizers and have concluded that for gaussian density, 
for r = 2, and for the same distortion, the entropy of the output of the 
uniform quantizer is higher than the theoretical lower bound based on 
the rate distortion theory by about 4 bit. Uniform quantizers are also 
good in an asymptotic sense, since they are optimum for a large number 
of levels.*4 Moreover, for Laplacian densities, as shown by Berger,” 
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uniform quantizers are optimum for any value of entropy. A different 
type of distortion measure has been considered by Elias.” 

The problem we consider is that of obtaining the parameters of 
quantizers such that D is minimized for a given constraint on the 
entropy. Although the approach taken here is suitable for a general 
distortion measure of eq. (1), we consider only the case of g(-) = (-)?, 
mainly to compare our results to those in the literature. In the next 
section, we present the necessary conditions that the optimum quan- 
tizer must satisfy for a local extremum. Then, in Section III, we con- 
struct a point-to-set mapping such that its fixed point satisfies the 
necessary conditions for a local extremum of our problem. A descrip- 
tion of the algorithm is then presented for completeness. In Section IV, 
we present the results of use of this algorithm for uniform, Laplacian, 
and gaussian densities. The distortion-entropy curves are presented for 
each case. We also present a surprising observation on the growth of 
computations as a function of dimensionality (i.e., the number of 
quantizer parameters to be optimized). 


ll. FORMULATION OF THE PROBLEM AND NECESSARY CONDITIONS 
Using g(-) = (-)?, the distortion of eq. (1) becomes 


N "LiZ+1 
D= > [b= wf@at 3) 
j=l Ja; 
Then the problem is to obtain {2;}, 7 = 2, ---, N, {yj}, 7=1,:°°°,N 


such that they minimize D subject to 6 S K, for a given N. The 
necessary conditions from the Karush-Kuhn-Tucker theory’ are that 
there exists a A 2 O such that 


VD(z) + \V&(z) = 0, (4) 


where z Is a vector of quantizer parameters and V denotes the gradient. 
For the parameters {y;}, since & is independent of {y;}, (4) becomes 


w= f ” spar | f ” (Od, j=leyN 6) 


This implies that the representative levels can be obtained explicitly 

by knowing the decision levels and therefore they do not add to the 

dimensionality of the problem. Also, the other necessary conditions are 
6SK 

and 


\(6 — K) = 0. (6) 


lll. FIXED-POINT APPROACH 


In this section, we formulate the quantization problem as a fixed- 
point problem and give a general description of the algorithm that 
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solves this problem. This algorithm is based on the theory of com- 
plementary pivoting.*4 

Given a point-to-set mapping I [i.e., to each point x in R” it as- 
sociates a subset I'(x) of R”], a fixed point of such a mapping is a point 
x such that x € I'(a). We show that the problem of finding the pa- 
rameters of the optimal quantizer can be formulated as a problem of 
finding a fixed point of a certain point-to-set mapping. 


3.1 Fixed-point formulation 


Let VD and V6 be the gradient vectors of the distortion D and 
entropy 6, respectively. Then, consider the following point-to-set 
mapping: 


xz — {VD(x)} E(a) << K 
T(x) = + x — hull {VD(z), VE(a)} E(x) = K, (7) 
a — {VE&(z)} &(z) > K 


where hull {£} is the smallest convex set containing /; 1.e., the convex 
hull of H, anda — A = {x — y:y € A} for a set A in R*. Note that 
the mapping as defined is upper semicontinuous* (u.s.c.) and the set 
I'(x) is convex for each x. As we subsequently see, these properties are 
needed if the algorithm is to find a fixed point of I. 

We now show that a fixed point of this mapping satisfies the necessary 
conditions of Section 2. 


Theorem: Let «x © T(x). Then, if &(2) S K, x satisfies the necessary 
conditions of Section 2. Otherwise, x is a local minimizer of &(2x). 


Proof: We construct the required \ and show that (6) is satisfied. 
Since x © I'(xz) and &(x) S K, we have two cases: 


Case (1): 6(a) < K. Let } = 0 and, since 0€ {VD(zaz)}, VD(z) 
+ AV&E(x) = 0, satisfying (6). Note that A 6(2) — K] = 0. 


Case (17): &(a) = K. Then, as 0 € hull {VD(z), V&(x)}, there exist 
At + Ae = 1, Ay = 0, de = 0 such that 


\MVD (x) + eVE(x) = 0. (8) 


Now, in case \; # O, letting \ = d2/A1 2 0, (4) is satisfied, and 
AL 6(x) — K] = 0. In the contrary case, a constraint qualification 
would be violated. 

In case x © I(x) and &(x) > K, then, since 0 € {V&(az)}, VE(x) 
= 0 and we have a local minimizer of 6(x2). If &(@) were a convex 
function, our problem has no feasible solution [i.e., an x such that 


“A mapping IL is u.s.c. if, for any two sequences {x*}, {y*} such that z* > 2, 
y* C T(x*), and y* > y, we have y € T(z). 
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&(x) < K ]. In the contrary case, we would conclude the algorithm 
has failed. 


3.2 Description of the algorithm 


In this section we give a brief description of the algorithm that com- 
putes fixed points of point-to-set mappings. Before going into the 
details of the algorithm, we introduce some notation. 

Given a set C in R”, and a point-to-set mapping I, by I'(C) we 


represent the set U I(x). Also, given a one-to-one linear mapping 7, 
rec 
we say a Set Cis 


(1) T—complete if 0 € hull {T(C)}, 
(it) r—complete if 0 € hull {r(C)}, and 
(az) I U r—complete if 0 € hull {T(C) Ur(C)}. 


The significance of IT'-complete sets is the following: in case I is 
u.s.c. and I(x) is convex for each x, a sequence C;, 7 = 1, 2, --- of 
I'-complete sets whose diameter approaches 0 as 7 approaches ©, con- 
verges to a fixed point of I (see, for example, Refs. 25-27). The fixed- 
point algorithms are designed to find such a sequence of I'-complete 
sets. 

These algorithms work with sets C that are simplexes of appropriate 
dimension. (An n-dimensional simplex is a convex body obtained by 
taking the convex hull of n + 1 affinely independent points in n-space. 
A two-dimensional simplex is a triangle; a three-dimensional simplex 
is a tetrahedron.) They start with a unique r-complete simplex and 
generate a sequence of I LU r-complete simplexes that terminate with 
a T'-complete simplex. There are essentially two basic algorithms that 
can be used to generate a sequence of '-complete simplexes of decreas- 
ing diameters. They are the restart method of Merrill?’ and the con- 
tinuous deformation method of Eaves and Saigal.?® A study of both these 
methods can be found in Saigal.?8—%? 

We now discuss an application of the algorithm. A real number 
d > 0 is chosen. Then the space R” X [0, d] is triangulated (1.e., each 
point in the space lies in an (n + 1)-dimensional simplex, and these 
simplexes overlap only on their boundaries) such that the vertices of 
the triangulation are only in the set R” X {d/2*}, k = 0,1, -:-. In 
addition, the diameter of each n-dimensional face of each (n + 1)- 
dimensional simplex that lies in R” X [d/2*t+, d/2*] is at most d/2*. 
Now, an arbitrary starting point x is chosen. We then define 

r(x) = —£ + 2o, (9) 
which is a one-to-one linear mapping. 

The sequence of IT U r-complete simplexes is then generated as 
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follows: 


Step 1: Start with an r-complete simplex in the triangulation that 
contains (2, d). The triangulation is arranged in such a way 
that there is a unique such simplex, and that this simplex 
has exactly one vertex in R” X {d/2}, and (n + 1) vertices 
in R” X {d}. The entering vertex is the one in R” X {d/2}. 
Design the labeling function Z on the vertices of the triangu- 
lation with 


Z2—2 forsomez E€ T(x) if t<d 
70 — > if t=d 
Step 2: Find the label on the entering vertex. 

Step 8: Find a new I U r-complete simplex that includes the enter- 
ing vertex, in place of some vertex of the older simplex. 
This is equivalent to the basic pivot operation of the simplex 
method. 

Step 4: Find the other (n + 1)-dimensional simplex that contains 
the new I U r-complete simplex found in Step 3, and de- 
termine the entering vertex. 

Step 5: If the entering vertex is outside R” X {d/2*, d}, stop. The 
earlier IU r-complete simplex is actually I-complete. 
Otherwise, go to Step 3. 


L(a, t) = (10) 


Having found a I'-complete simplex 7, say, whose vertices are V', 


V2, «++, V=t!, where V* = (v*, di), 7 = 1, ---, n +1, we have de- 
termined points z? € ['(v‘) and aA = (Ay, °--, Anzi) 2 O such that 
n+l ; 
a Az? = 0 
a (11) 
n+l 
DA = 1 


i=l 
has a solution. In this case, we say that the point x determined by 


n+1 
t= 2X div" (12) 
is an approximate fixed point (for justification, see Ref. 26). 

Since the stopping criterion at Step 5 requires that we generate a 
vertex In R”™ X {d/2*}, we have generated a sequence of I'-complete 
sets C,, the last one of diameter less than d/2*, and have thus found a 
reasonable solution. 

The procedure for triangulation R” X (0, d] generally used is 
called J3 in the literature. For a more detailed description of this 
algorithm, the reader is referred to Ref. 32. 
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IV. EXAMPLES 


In this section, we discuss some examples that we solved using the 
algorithm described in the previous section. Three of the four examples 
had f(-) = p(-) corresponding to mean square quantization error as 
a measure of distortion. The fourth example, on the other hand, uses 
a different weighting of the quantization noise; it is motivated by the 
problem of quantizer design for simple element differential coding of 
picture signals. The examples are: 


(z) f(x) = p(x) = ax; —16S5 25 +16 
= 0 otherwise 
(iz) f(x) = p(x) = * gate, —x <zr<+un0,a=0.1 
(iit) f(x) = p(z) = eee —xo<r<+o,a=1 (13) 
Ta 


(iv) fe) = ZeP; play = Leatl, 0 <a< te 
a=0.18,6=01; and a =0.1,6 = 0.065. 


Due to symmetry of functions f(-) and p(-), the optimum quantizers 
are symmetric and, for simplicity therefore, quantizers were con- 


MEAN SQUARE ERROR 





“1.0 15 2.0 2.5 3.0 3.5 4.0 
ENTROPY (BITS) 


Fig. 1—Quantizer performance for uniform density. Minimum mean square error 
(MMSE) is plotted against entropy for a fixed number of levels (NV). Only odd-level 
quantizers are considered. For each fixed number of levels, MMsm decreases with 
entropy up to a certain point, after which there is no further decrease in mean square 
error. 
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Fig. 2—Quantizer performance for Laplacian density. 


4.0 


strained to be symmetric. Also, without loss of any generality, only 
quantizers having odd numbers of levels were considered. In each case, 
several problems were solved by varying the entropy constraint and 
the number of levels. The number of levels were varied from 3 to 21, 
and the entropy constraint was varied from 1.0 bit to the largest 


possible bits using a particular number of levels. 


Results of these simulations are given in Figs. 1 through 5. In these 
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Fig. 3—Quantizer performance for gaussian density. 
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Fig. 4—Quantizer performance for Laplacian density and exponential weighting. 
Quantization noise is weighted by 1/|8| exp(—8|z|), whereas the probability 
density is taken to be 1/|a| exp (—a|z|). Such situations arise in quantization of 
the prediction errors in predictive coding of the television signals: a = 0.1, 8 = 0.065. 


figures the distortion is plotted logarithmically on y-axis and the 
entropy is plotted linearly on z-axis in bits. Alternate solid and broken 
lines are shown for different values of quantizer levels. For a given 
number of levels, the minimum distortion decreases approximately 
exponentially with respect to the entropy up to a certain point and 
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Fig. 5—Quantizer performance for Laplacian density and exponential weighting: 
a = 0.18, 8 = 0.1. 


FIXED-POINT ALGORITHM ~ 1431 


then the entropy constraint is not operative any longer, and conse- 
quently the distortion remains a constant. These are indeed Lloyd- 
Max". quantizers that minimize the distortion for a given number of 
levels with no constraint on the entropy. The distortion versus entropy 
curves are lower bounded by the following functions: 


Example (1) D = 85.3 exp(—1.39E) 

Example (2) D = 196.18 exp(—1.32E) 

Example (3) D = 1.40 exp(—1.39£) (14) 
Example (4a) D = 62.50 exp(—1.31E) 
Example (4b) D = 176.71 exp(—1.24E). 


In the case of uniform densities, the optimum quantizer is non- 
uniform whenever the entropy constraint 1s operative, but when the 
entropy constraint is too large and inoperative, the optimum quantizers 
are uniform. Laplacian densities, on the other hand, always have uni- 
form quantizers as the optimum quantizers. This has been shown by 
Berger.”° In the case of gaussian density, the optimum quantizers were 
not uniform; however, a comparison of our results with those given by 
Goblick and Holsinger!’ indicates that, although nonuniform quantizers 
perform better than uniform quantizers, the differences in the per- 
formance of the two are somewhat small. This conclusion has also been 
reached by Wood?® and Berger.®° The case of an exponential weighting 
function falling slower than the probability density function arises in 
quantization of the prediction error in a simple element differential 
coding of picture signals. In this case, the density of the prediction 
error 1s approximately Laplacian, whereas the perceptual visibility®:* 
of the quantization noise may be approximated by an exponential 
function decaying somewhat slower than the probability density. The 
distortion-entropy curves for this case show larger improvement (that 
is, for a given entropy the distortion decreases much more than in the 
previous examples) as the number of levels is increased. Also, the 
optimum quantizers are nonuniform. Improvement in their perform- 
ance over that of the uniform quantizers is more significant than in the 
previous examples. It is interesting to note that our algorithm can 
solve Lloyd-Max problem trivially by sctting the entropy constraint 
to a very high value. This algorithm was also used in other applica- 
tions related to adaptive quantization*‘ of picture signals. The prob- 
lems in this case were such that they had uniform (constant) weighting 
functions and two-sided exponentials as the density functions. The 
resulting quantizers had interesting structure and were used quite 
successfully. 
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4.1 Computational effort as a function of n 

The increase in computational effort as a function of dimension, n, 
of the problem is important in the study of algorithms. In the case of 
fixed-point algorithms, Saigal?® had speculated that different tri- 
angulations would have different effects on this growth and he was the 
first to propose a measure to describe it. For the triangulation em- 
ployed in our experimentation, his measure predicted the growth rate 
of the number of iterations as n?. Subsequently, Todd* refined his 
measure to predict an “‘average’”’ growth rate of the iterations as n?. 
The measure of Saigal, in some sense, predicts the ‘‘worst case” 
behavior. 

The computational experiments in Section IV were ideally suited 
to test the theoretical predictions of Refs. 28 and 35, since the dimen- 
sion of the problem was increased in a regular manner, the starting 
points were chosen in a regular way, and the problems of dimensions 
varying between 1 and 10 were solved. A number of results for various 
entropy values were plotted on the log-log paper. A representative 
plot is given in Fig. 6. It is seen that the experimental points lie on a 
straight line. The slope of these lines for different cases was a function 
of the entropy constraint and the probability density used and varied 
from 1.55 to 1.88, which is between 1.5 predicted by Todd** and 2 
predicted by Saigal.?8 

Thus, we can conclude, with a high degree of certainty, that the 
number of iterations of the algorithm to solve a problem of dimension 
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Fig. 6—Growth of computations vs dimensionality. Number of representative 
levels N is two times the dimensionality n plus 1. Straight line drawn is the minimum 
mean square error fit to the observations shown by A. 
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n would require an® for some b between 1.5 and 2.0. Since each iteration 
requires O(n?) multiplications and at most one evaluation of the 
function, the number of function evaluations is bounded by O(n?) and 
multiplications by 0(n4). 


V. CONCLUSIONS 


A fixed-point formulation has been developed to minimize the dis- 
tortion, using a fairly general distortion measure, with respect to pa- 
rameters of a quantizer under an entropy constraint on the quantized 
output. A point-to-set mapping is first developed whose fixed point 
satisfies the necessary conditions for a local extremum. Then a com- 
puter program is developed to compute its fixed points. Several ex- 
amples are solved to show the usefulness of the algorithm. Finally, the 
rate of growth of the computations used by the algorithm as a function 
of the dimensionality of the problem is also discussed. 
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MATERIALS SCIENCE 


Anodic Passivation and Coating of AlAs in Aqueous Solutions. W. D. Johnston, Jr., 
J. Electrochem. Soc., 123 (March 1976), pp. 442-443. Uniform, stable oxide 
films have been grown on VPE deposited AlAs layers by anodization at constant 
current in pH 2.0 water. The oxide has refractive index ~1.8 and is suitable as a 
pee and anti-reflective coating for solar cells, LEDs, and other devices made 
rom Ss. 


Enhanced Solubility and Ion Pairing of Cu and Au in Heavily Doped Silicon at High 
Temperatures. R. L. Meek and T. E. Seidel, J. Phys. Chem. Solids, 36 (1975), pp. 
731-740. The equilibrium solubilities of Cu and Au in silicon have been calculated 
for high temperatures (900-1100°C) and heavy dopings (10'°-102! em~?) and are 
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compared with experimental results for uniformly bulk doped and diffusion doped 
material. For strongly extrinsic n type material, a large solubility enhancement 
(about 103 times the intrinsic solubility) is calculated, due to ion pairing of the sub- 
stitutional metal acceptor with donors. The saturation metal solubilities observed in 
bulk samples and diffused layers are in substantial agreement (within a factor of 
~2) with calculations for all temperatures and doping levels. 


Melting Point Depression and Kinetic Effects of Cooling on Crystallization in Poly- 
(Vinylidene Fluoride)—Poly(Methyl Methacrylate) Mixtures. T. Nishi* and T. 
T. Wang, Macromolecules, 8 (November 1975), pp. 909-915. Melting point 
depression has been observed in mixtures of poly(vinylidene fluoride) and poly- 
(methyl methacrylate). The phenomenon is ascribed to thermodynamic mixing of 
two compatible polymers. An appropriate expression is derived from which the 
interaction parameter for the polymer pair was found to be —0.295 at 160°C. 
* Bridgestone Tire Co., Ltd., Tokyo, Japan. 


Probability of Static Fatigue Failure In Optical Fibers. D. Kalish and B. K. Tariyal, 
Appl. Phys. Lett., 28 (June 15, 1976), pp. 721-723. An expression for the prob- 
ability of static fatigue failure in glass is developed based upon a Weibull-type 
cumulative strength distribution and a fracture mechanics slow crack growth law. 
Good agreement between the model and experimental results is demonstrated. Ex- 
amples for using this model as a predictive tool are presented. 


Sputtering Techniques and Applications. A. K. Sinha, Electron. Packag. Prod., 16, 
No. 8 (October 1975), pp. V10-V14. Important concepts are stated that should 
help understand and utilize sputtering technique for thin-film deposition. A state-of- 
the-art description is given of commercially available sputtering variants; namely, 
rf-diode sputtering, dc-diode sputtering, triode sputtering, and the magnetron sputter 
source. Effect of deposition variables is described on film composition, microstructure, 
and properties. 


PHYSICS 


Concentration-Dependent Absorption and Spontaneous Emission of Heavily Doped 
GaAs. H. C. Casey, Jr., and F. Stern, J. Appl. Phys., 47 (February 1976), pp. 631-— 
643. A model for the calculation of the absorption and emission spectra for GaAs 
at carrier concentrations in excess of 1 X 10!8 em7* is described. Calculated absorption 
and emission spectra are compared to previous experimental results, which permits 
assignment of the concentration dependence of the energy gap. The concentration- 
dependent thermal equilibrium electron-hole density product and radiative lifetime 
are calculated for p-type GaAs. 


Determination of the Stress in Optical Fibers by Means of a Polariscope. M. J. 
Saunders, Rev. Sci. Instrum., 47 (April 1976), pp. 496-500. A polariscope is 
used in conjunction with a diameter-measuring instrument to determine the relation- 
ship between the tension of a Vycor-clad quartz fiber, as it is being drawn, and the 
diameter of the fiber. The polariscope is also used to determine the stress optical 
coefficient of optical fibers and preforms. 


The Effects of Soft Modes on the Structure and Properties of Materials. P. A. 
Fleury, Ann. Rev. Mat. Sci., 6 (1976), pp. 157-180. This paper reviews recent 
developments regarding the role of crystal lattice instabilities (soft modes) in struc- 
tural phase transitions and the associated enhanced or anomalous physical properties 
of materials. Material systems considered include ferroelectrics, superconductors, 
and metastable alloys. Phenomena considered include light scattering, critical fluctua- 
nom, a mode interactions, etc. Device applications of soft mode effects are 
iscussed. 


Metal-Induced Extrinsic Surface States on Si, Ge, and GaAs. J. E. Rowe, J. Vacuum 
Sci. Technol., 13 (January/February 1976), pp. 248-250. Evidence for extrinsic 
metal-induced empty surface states during the Schottky-barrier formation on 
Si(111), GaAs(111), Ge(111), and Ge(100) is obtained with electron-energy loss 
spectroscopy. UV photoemission spectroscopy provides similar evidence for occupied 
extrinsic states on Si(111). The anomalous results on (110) surfaces are discussed in 
terms of a simple structural model. 
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Optically-Induced Energy Level Shifts for Intermediate Intensities. P. F. Liao and 
J. E. Bjorkholm, Opt. Commun., 16, No. 3 (March 1976), pp. 392-395. We 
report measurements of optically-induced energy level shifts by nonresonant light 
at intermediate intensities in atomic sodium vapor. The intensity-dependence of the 
shifts departs substantially from the linear behavior predicted by second order 
perturbation theory, but is in good agreement with more exact calculations and yields 
0.10 for the 3P;-4D oscillator strength. 


Plasmon-Interband Coupling in Gallium Compounds. J. E. Rowe, J. C. Tracy, and 
S. B. Christman, Surface Sci., 52 (1976), pp. 277-284. Electron energy loss 
spectra have been measured for metallic Ga and the semiconductor compounds 
GaSb, GaAs, GaP, and GaN. The intensity at the measured plasmon energy decreases 
with increasing ionicity of the compound semiconductor. This is explained by a 
simple model involving a coupling of plasmon oscillations and interband transitions. 


Production of Stabilized Coloration in Alkali Halides by a Two-Photon Absorption 
Process. L. F. Mollenauer, G. C. Bjorklund, and W. J. Tomlinson, Phys. Rev. Lett., 
85, No. 24 (December 15, 1975), pp. 1662~1665. We have measured the stabilized 
coloration produced from the conversion of U centers in KCl by means of a two- 
photon process. The measurements show that the coloration involves energy transfer 
via electron-hole pairs, rather than direct photoexcitation of the U centers. The 
absolute two-photon absorption cross section of KCl at \ = 266 nm was determined. 


Pyroelectric Sign of LilO;. E. H. Turner, J. Appl. Crystallog., 9 (February 1976), 
p. 52. The sign of the total pyroelectric coefficient, p3 of LiIO; is related to the 
absolute configuration. Using the piezoelectric sign convention, p3 is found to be 
positive. 


SYSTEMS ENGINEERING AND OPERATIONAL RESEARCH 


On the Output of a GI/M/N Queuing System with Interrupted Poisson Input. 
H. Heffes, Oper. Res., 24 (May-June 1976), pp. 530-542. We characterize the 
departure process as a semi-Markov process and give results for the joint distribution 
of the number of customers in the system and the state of the input process at service 
completions. We also present results relating to the interdeparture time distribution 
and the distribution of the nonbusy period and compare the results with some known 
results for single-server systems. 
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